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Attached hereto are the following: 



1 . a supplemental information disclosure statement; 

2. copies of non-patent references cited in the supplemental information disclosure 



3. a Statement in Support of Petition to Make Special with attachment Items 1-8; 



4. a check in the amount of $130.00 for the petition fee under 37 C.F.R. § L17(h). 

It is respectfully submitted that all of the requirements of 37 C.F.R. § 1.102(d), 37 
C.F.R. § 1.17(h), and MPEP § 708.02(VIII) have been satisfied. Accordingly, it is 
respectfully requested that this Petition to Make Special be granted and that an 
accelerated examination of the above-identified application be ordered. 

No fee other than the enclosed $130.00 petition fee under 37 C.F.R. § 1.17(h) is 
believed to be due in connection with this Petition. If any additional fees are due, or an 
overpayment has been made, please charge, or credit, our Deposit Account No. 1 1-0171 
for such sum. 

If the Examiner has any questions regarding the present Petition or the subject 
application, the Examiner is cordially invited to contact Applicants' attorney at the 
telephone number provided below. 



statement; 



and 



Respectfully submitted, 




Tor Smeland 
Registration No.: 43,13 1 
Attorney for Applicants 



Kalow & Springut LLP 
Telephone No.: (212)813-1600 
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1 . Pursuant to the enclosed petition under 37 C.F.R. §1.1 02(d) and MPEP 
708.02(VIII), Applicants petition to make special the above-captioned 
application, and provide this Statement in support of the enclosed Petition to 
Make Special. 

2. This is a new application, filed November 14, 2003, which has not undergone any 
examination by an Examiner. 



3. The following items are attached to this Statement: 

a. A paper copy of the application as filed, along with a copy of a CD-ROM 
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disk containing tables related to nucleic acid listings, which CD-ROM 
copy was filed with the paper portion of the application on November 14, 
2003 as Item 1; 

b. a copy of the Preliminary Amendment amending tables in the paper copy 
of the application, filed May 20, 2004, as Item 2; 

c. a copy of the Preliminary Amendment directing entry of the sequence 
listing, filed June 30, 2004, as Item 3; 

d. a copy of the Supplemental Preliminary Amendment canceling claims 9- 
18, filed by fax on April 21, 2005, as Item 4; 

e. a copy of the International Search Report for PCT/US2003/036787, as 
Item 5; 

f a copy of published International Patent Application 
PCT/US2003/036787, as Item 6; 

g. a copy of the information disclosure statement filed on March 7, 2005, as 
Item 7; and, 

h. a copy of the information disclosure statement filed on January 3 1 , 2005, 
as Item 8. 

4. A supplemental information disclosure statement is filed herewith and enclosed 
with this mailing, and includes copies of the non-patent references cited in the 
supplemental information disclosure statement. 

5. A preexamination search was made on this application. The preexamination 
search is attached to this Statement as Item 5, which is an International Search 
Report issued by the U.S. Patent and Trademark Office (USPTO), acting as an 
International Search Authority, on February 25, 2005 in connection with 
International Patent Application No. PCT/US2003/036787, filed November 14, 
2003, published as WO 2004/045543 A2 on June 3, 2004. A copy of publication 
No. WO 2004/045543 A2 is attached as Item 6. The disclosure and claims of 
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International Patent Application No. PCT/US2003/036787 are the same as the 
disclosure and claims for this application for which special status is requested, 
with the exception of the attached preliminary amendments made in the instant 
application. 

6. A description of the preexamination search can be found in the International 
Search Report attached as Item 5, and summarized below: 

Classification of Subject Matter 

IPC(7): C12Q 1/68; C07H 21/00 

US CL: 435/6; 536/24.5 

Fields Searched 

US: 435/6; 536/24.5 

Electronic Data Base Consulted During the International Search 
CAplus, Medline, Blosis, USPatfuU, Derwent, JPO, EPO 
Search Terms: (SIRNA OR RNAI OR DSRNA) and (OPTIMIZATION 

OR OPTIMIXZE OR OPTIMAL) and ALGORITHM 

7. All claims in this application are directed to a single invention. 

8. No unity of invention objection has been made to date in PCT/US2003/036787. 

9. One copy of each of the references cited in the International Search Report, and a 
copy of the Intemational Search Report, are transmitted herewith. 

10. A discussion of each of the references cited in the Intemational Search Report is 
provided below. Each of the references cited in the Search Report were 
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categorized as "A" references. The discussion points out how the claimed subject 
matter in this application is patentable over each reference. 

WO 03/064625 bv Woolf and Taylor 

11. Applicants submitted International Patent Application Publication No. WO 
03/064625 to Woolf and Taylor ("Woolf) by information disclosure statement 
dated January 31, 2005. 

12. The pending claims of the instant application are patentable over Woolf. 
Woolf — alone or in combination with any other reference herein — does not 
disclose, teach, or suggest any of the pending claims. 

13. Woolf discloses single and double stranded oligonucleotides for inhibiting gene 
expression. Woolf discloses an oligonucleotide composition having at least 3 
different oligonucleotides targeted to at least three different nucleotide sequences 
within a target gene (see for example, page 1, lines 28-30). Woolf discloses that 
an oligonucleotide composition can have from about 2 oligonucleotides to greater 
than about 20 oligonucleotides (see, for example, page 26, lines 1-3); that the 
oligonucleotides can be present at varying concentrations (page 26, lines 11-13); 
and that they are preferably all double stranded (page 26, lines 16-17). Woolf 
discloses that the composition inhibits gene expression to an extent that is greater 
than the level of inhibition of gene expression achieved by any of the individual 
oligonucleotides of the composition acting alone (page 27, lines 13-15). 
Regarding selection of double stranded RNAs, Woolf discloses that they should 
have at least about 60% identity with a target gene (page 18, lines 23-25), and 
discloses certain alignment criteria for determining identity of a double stranded 
RNA and a target (page 19). 

14. In contrast, claim 1 of the present application recites a method for selecting 
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siRNA comprising selecting an siRNA of 19-25 nucleoside bases. The method 
comprises selecting a target gene and measuring the functionality of sequences 
19-25 nucleotides in length that are substantially complementary to a stretch of 
nucleotides in a target sequence, wherein the functionality is dependent upon non- 
target specific criteria, as recited in claim 1 of the instant application. Among 
other things, Woolf does not disclose functionality dependent upon non-target 
specific criteria, as recited in claim 1. Accordingly, Woolf does not disclose, 
teach, or suggest claim 1 . Claims 2-5 depend on claim 1 , and thus for at least the 
reasons that claim 1 is patentable over the cited reference, claims 2-5 are 
patentable over it as well. 

15. Claim 6 of the present application recites a method for developing an siRNA 
algorithm for selecting siRNA. The method comprises, among other things, 
determining the amount of improved functionality of an siRNA by the presence or 
absence of at least one of a number of specified variables. These variables are not 
disclosed in Woolf Accordingly, Woolf does not disclose, teach, or suggest 
claim 6. Claims 7-8 depend on claim 6, and thus, for at least the reasons that 
claim 6 is patentable over the cited reference, claims 7-8 are patentable over it as 
well. 

16. Claim 19 recites a kit comprising at least two siRNA, wherein the at least two 
siRNA comprise a first optimized siRNA and a second optimized siRNA, wherein 
the siRNA are optimized according to recited formulas. Woolf does not disclose, 
teach, or suggest the recited formulas. Accordingly, Woolf does not disclose, 
teach, or suggest claim 19. 

U.S. Patent Application No. 2002/0150945 Al bv Finney et aL 

17. The Search Report categorized U.S. Patent Application No. 2002/01 50945 A 1 
("Finney") as an "A" reference, specified pages 27-28, and indicated relevance to 
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claims 1-19 of this application. Claims 9-18 have been canceled by the 
Supplemental Preliminary Amendment filed April 22, 2005 (Item 4, attached). 

18. The pending claims of the instant application are patentable over Finney, 
Finney — alone or in combination with any other reference in the Search Report — 
does not disclose, teach, or suggest any of the pending claims. 

19. Finney alleges disclosure of, inter alia, dsRNA libraries whereby each dsRNA is 
capable of reducing the level of expression of a specific gene (see Finney at 
paragraph [0322], page 27). Finney alleges that more than one "RNAi molecule" 
may be administered simultaneously or sequentially to a subject or to cells in 
vitro, asserting that dsRNAs designed to different sequences or regions of a gene 
can be pooled and administered as one formulation; alternatively, a formulation 
can comprise dsRNAs that target the mRNA transcripts of different genes (see 
paragraph [0038], page 28). 

20. In contrast, claim 1 of the present application recites a method for selecting 
siRNA comprising selecting an siRNA of 19-25 nucleoside bases. The method 
comprises selecting a target gene and measuring the functionality of sequences 
19-25 nucleotides in length that are substantially complementary to a stretch of 
nucleotides in a target sequence, wherein the functionality is dependent upon non- 
target specific criteria, as recited in claim 1 of the instant application. Among 
other things, Finney does not disclose functionality dependent upon non-target 
specific criteria, as recited in claim 1 . Accordingly, Finney does not disclose, 
teach, or suggest claim 1 . Claims 2-5 depend on claim 1 , and thus for at least the 
reasons that claim 1 is patentable over the cited reference, claims 2-5 are 
patentable over it as well. 



21 . Claim 6 of the present application recites a method for developing an siRNA 
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algorithm for selecting siRNA. The method comprises, among other things, 
determining the amount of improved functionality of an siRNA by the presence or 
absence of at least one of a number of specified variables. These variables are not 
disclosed in Finney. Accordingly, Finney does not disclose, teach, or suggest 
claim 6. Claims 7-8 depend on claim 6, and thus, for at least the reasons that 
claim 6 is patentable over the cited reference, claims 7-8 are patentable over it as 
well. 

22. Claim 19 recites a kit comprising at least two siRNA, wherein the at least two 
siRNA comprise a first optimized siRNA and a second optimized siRNA, wherein 
the siRNA are optimized according to recited formulas. Finney does not disclose, 
teach, or suggest the recited formulas. Accordingly, Finney does not disclose, 
teach, or suggest claim 19. 

Kasif et al (2002) A computational framework for optimal masking in the 
synthesis of oligonucleotide niicroarravs> Nucleic Acids Research^ 30/20:el06 

23. The Search Report categorized Kasif et a/. ("Kasif ) as an "A" reference, specified 
the entire article, and indicated relevance to claims 1-19 of this application. 
Claims 9-18 have been canceled by the Supplemental Preliminary Amendment 
filed April 21, 2005 (Item 4, attached). 

24. The pending claims of the instant application are patentable over Kasif Kasif — 
alone or in combination with any other reference in the Search Report — does not 
disclose, teach, or suggest any of the pending claims. 

25. Kasif discloses a computational formalization of the optimal synthesis strategy for 
oligonucleotide arrays (see for example, Kasif page 6, first sentence of 
Conclusions). Kasif is directed to formalizing a synthesis problem for making 
oligonucleotide arrays, determining precise bounds of the complexity of such a 
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synthesis, and devising computational solutions (see for example, Kasif at page 1, 
Abstract). The disclosure focuses on development of an optimal base addition 
strategy for making an oligo array (see for example, Kasif at page 1, last 
sentence). 

26. In contrast, claim 1 of the present application recites a method for selecting 
siRNA comprising selecting an siRNA of 1 9-25 nucleoside bases. The method 
comprises selecting a target gene and measuring the functionality of sequences 
19-25 nucleotides in length that are substantially complementary to a stretch of 
nucleotides in a target sequence, wherein the fimctionality is dependent upon non- 
target specific criteria, as recited in claim 1 of the instant application. Among 
other things, Kasif does not disclose measuring functionality of any sequences. 
Accordingly, Kasif does not disclose, teach, or suggest claim 1. Claims 2-5 
depend on claim 1, and thus for at least the reasons that claim 1 is patentable over 
the cited reference, claims 2-5 are patentable over it as well. 

27. Claim 6 of the present application recites a method for developing an siRNA 
algorithm for selecting siRNA comprising, among other things, determining the 
amount of improved functionality of an siRNA by the presence or absence of at 
least one of a number of specified variables. These variables are not disclosed in 
Kasif Accordingly, Kasif does not disclose, teach, or suggest claim 6. Claims 7 
and 8 depend on claim 6, and thus for at least the reasons that claim 6 is 
patentable over the cited reference, claims 7 and 8 are patentable over it as well. 

28. Claim 19 recites a kit comprising at least two siRNA, wherein the at least two 
siRNA comprise a first optimized siRNA and a second optimized siRNA, wherein 
the siRNA are optimized according to recited formulas. Kasif does not disclose, 
teach, or suggest the recited formulas. Accordingly, Kasif does not disclose, 
teach, or suggest claim 19. 
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Amarzguioui et al (2000) Secondary structure prediction and 
in vitro accessibility of mRNA as tools in the selection of 
target sites for ribozymes. Nucleic Acids Research^ 28/2L 4113-4124 

29. The Search Report categorized Amarzguioui et a/, ("Amarzguioui") as an "A" 
reference, specified the entire article, and indicated relevance to claims 1-19 of 
this application. Claims 9-18 have been canceled by Supplemental Preliminary 
Amendment filed April 21, 2005 (Item 4, attached). 

30. The pending claims of the instant application are patentable over Amarzguioui. 
Amarzguioui — alone or in combination with any other reference in the Search 
Report — does not disclose, teach, or suggest any of the pending claims. 

31. Amarzguioui discloses DNA-armed chemically modified hammerhead ribozymes 
targeting eight GUC and GUA sites selected by in vitro accessibility assays and 
MFold prediction (a computational method used here to calculate target sequence 
firee energies; see Amarzguioui at page 4121, bottom third of first column). 
Amarzguioui focuses on target-specific criteria for selecting ribozymes. (see, for 
example, Id. \ Amarzguioui at page 4113, Abstract first sentence; Amarzguioui at 
page 41 13, first paragraph of Introduction; Amarzguioui at page 4117, column 2, 
first full paragraph; and Amarzguioui at page 4118, final paragraph, spanning to 
the top of page 4119). Amarzguioui discloses that antisense 
oligodeoxynucleotides can be used to determine accessibility of stretches in 
mRNA by an in vitro assay that measures RNase H-mediated cleavage at sites of 
hybridization between the antisense oligos and the mRNA (see, for example, 
Amarzguioui, final paragraph of page 4113 through end of paragraph on page 

41 14). Amarzguioui also discloses specific structural features of ribozymes that 
include 8 -H 8 nt arms, a stem II structure 2 basepairs long, deoxynucleotides in 
flanking arms, 2'-0-alkylated residues in the core and stem loop II, a 3' inverted 
thymidine, a 5' hexanol moiety, and short stretches of phosphorothioate linkages 
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at one or both ends (see, for example, first full paragraph on page 4114, first 
column, through end of paragraph in column 2). Amarzguioui investigates 
correlation between predicted structure of target sites and in vivo efficacy of 
corresponding ribozymes, focusing on target site accessibility parameters that 
include local free energy of folding of the target sequence, size of single stranded 
stretches that might function as nucleation sites for duplex formation, and length 
and stability of stems and helices (see for example, Amarzguioui at page 4121, 
middle third of first column). 

32. In contrast, claim 1 of the present application recites a method for selecting 
siRNA comprising selecting an siRNA of 19-25 nucleoside bases, comprising 
selecting a target gene and measuring the functionality of sequences 19-25 
nucleotides in length that are substantially complementary to a stretch of 
nucleotides in a target sequence, wherein the functionality is dependent upon non- 
target specific criteria. Among other things, Amarzguioui does not disclose 
selecting an siRNA using a method that comprises measuring functionality, where 
functionality depends on non-target specific criteria. Accordingly, Amarzguioui 
does not disclose, teach, or suggest claim 1. Claims 2-5 depend on claim 1, and 
thus for at least the reasons that claim 1 is patentable over the cited reference, 
claims 2-5 are patentable over it as well. 

33. Claim 6 of the present application recites a method for developing an siRNA 
algorithm for selecting siRNA comprising, among other things, determining the 
amount of improved functionality of an siRNA by the presence or absence of at 
least one of a number of specified variables. These variables are not disclosed in 
Amarzguioui. Accordingly, Amarzguioui does not disclose, teach, or suggest 
claim 6. Claims 7 and 8 depend on claim 6, and thus for at least the reasons that 
claim 6 is patentable over the cited reference, claims 7 and 8 are patentable over it 
as well. 
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34. Claim 19 recites a kit comprising at least two siRNA, wherein the at least two 
siRNA comprise a first optimized siRNA and a second optimized siRNA, wherein 
the siRNA are optimized according to recited formulas. Amarzguioui does not 
disclose, teach, or suggest the recited formulas. Accordingly, Amarzguioui does 
not disclose, teach, or suggest claim 19. 



35. Applicants submit that all requirements of 37 C.F.R. 1.102(d), 37 C.F.R. 1.17(h), 
and MPEP 708.02(VIII) have been satisfied. Accordingly, Applicants request 
that this application be granted special status. 



Respectfully submitted. 




Tor Smeland 
Registration No.: 43,131 
Attomey for Applicants 



Kalow & Springut LLP 
Telephone No.: (212)813-1600 
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DOCKET 13499 

Functional and Hyperfunctional siRNA 

5 Cross Refe rence to Related Applicatinns 

This application claims the benefit of the filing date of U.S. Provisional 
Application Serial No. 60/426,137, filed November 14, 2002, entitled "Combinatorial 
Pooling Approach for siRNA Induced Gene Silencing and Methods for Selecting 
SiRNA," and U.S. Provisional Application Serial No. 60/502,050, filed September 10, 
10 2003, entitled "Methods for Selecting siRNA," the entire disclosures of which are 
hereby incorporated by reference into the present disclosure. 

Reference to Tahl es Submitted in Electronic Form 

Applicants hereby incorporate by reference the material submitted herewith, in 
1 5 duplicate on the compact disks labeled DOCKET 1 3499, PATENT APPLICATION, 
DISK 1 of 1, COPY 1 of 2; and DOCKET 13499, PATENT APPLICATION, DISK 1 
of 1, COPY 2 of 2; which copies are identical, in files entitled Table_12.txt, date of 
creation June 26, 2003, with a size of 31,045 kb; Table_13.txt, date of creation 
November 13, 2003, with a size of 78,451 kb; Table_14.txt, date of creation 
20 November 13, 2003, with a size of 454 kb; and Table_15.txt date of creation 
November 13, 2003, widi a size of 1,690 kb. 

Field of Invention 

The present invaition relates to RNA interference ("RMAi"). 

25 

Background of the Invention 

Relatively recently, researchers observed that double stranded RNA 
("dsRNA") could be used to inhibit protein expression. This ability to silence a gene 
has broad potential for treating human diseases, and many researchers and 
30 commercial entities are currently investing considerable resources in developing 
therapies based on this technology. 

Double stranded RNA induced gene silencing can occur on at least three 
different levels: (i) transcription inactivation, which refers to RNA guided DNA or 

I 



histone methylation; (ii) siRNA induced mRNA degradation; and (iii) mRNA induced 
transcriptional attenuation. 

It is generally considered that the majormechanism of RNA induced silencing 
5 (RNA interference, or RNAi) in mammalian cells is mRNA degradation. Initial 
attempts to use RNAi in mammalian cells focused on the use of long strands of 
dsRNA. However, these attempts to induce RNAi met with limited success, due in 
part to the induction of the interferon response, which results in a general, as opposed 
to a target-specific, inhibition of protein syndiesis. Thus, long dsRNA is not a viable 
1 0 option for RNAi in mammalian systems. 

More recently it has been shown that when short (18-30 bp) RNA duplexes are 
introduced into mammalian cells in culture, sequence-specific inhibition of target 
mRNA can be realized without inducing an interferon response. Certain of these 

1 5 short dsRNAs, referred to as small inhibitory RNAs ("siRNAs"), can act catalytically 
at sub-molar concentrations to cleave greater than 95% of the target mRNA in the 
cell. A description of the mechanisms for siRNA activity, as well as some of its 
applications are described in Provost et ai, Ribonuclease Activity and RNA Binding of 
Recombinant Human Dicer. E.M.B.O. J., 2002 Nov. 1; 21(21): 5864 -5874; Tabara et 

20 al. The dsRNA Binding Protein RDE-4 Interacts with RDE-1. DCR-1 and a DexH- 
boxHelicase to Direct RNAi in C. elegans. CeU 2002, June 28;109(7):861-71; Ketting 
et al.. Dicer Functions in RNA Interference and in Synthesis of Small RNA Involved in 
Developmental Timing in C elegans; Martinez et al. Single-Stranded Antisense 
SiRNAs Guide Target RNA Cleavage in RNAi, Cell 2002, Sept. 6; 1 10(5):563; 

25 Hutvagner & Zamore, A microRNA in a multiple-turnover RNAi enzyme complex. 
Science 2002, 297:2056. 

From a mechanistic perspective, inti-oduction of long double stranded RNA 
into plants and invertebrate cells is broken down into siRNA by a Type III 
30 endonuclease known as Dicer. Sharp, RNA interference— 2001, Genes Dev. 2001, 
15:485. Dicer, a ribonuclease-III-like aizyme, processes tiie dsRNA into 19-23 base 
pair short interfering RNAs witii characteristic two base 3 ' ovwhangs. Bernstein, 
Caudy, Hammond, & Hannon, Role for a bidentate ribonuclease in the initiation step 
of RNA interference. Natiire 2001, 409:363. The siRNAs are then incorporated into 
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an RNA-induced silencing complex (RISC) where one or more helicases unwind the 
siRNA duplex, enabling the complementary antisense strand to guide target 
recognition. Nykanen, Haley, & Zamore, ATP requirements and small interfering 
RNA structure in theRNA interference pathway. Cell 2001, 107:309. Upon binding to 
5 the appropriate target mRNA, one or more endonucleases within the RISC cleaves the 
target to induce silencing. Elbashir, Lendeckel, & Tuschl, RNA interference is 
mediated by 21- and 22-nucleotide RNAs, Genes Dev 2001, 15:188, Figure 1. 

The interference effect can be long lasting and may be detectable after many 
10 cell divisions. Moreover, RNAi exhibits sequence specificity. Kisielpw, M. et al. 
(2002) Isoform-specific knockdown and expression of adaptor protein ShcA using 
small interfering RNA, J. of Biochemistry 363: 1-5. Thus, the RNAi machinery can 
specifically knock down one type of transcript, while not affecting closely related 
mKNA. These properties make siRNA a potentially valuable tool for inhibiting gene 
1 5 expression and studying gene function and drug target validation. Moreover, siRNAs 
are potentially useful as therapeutic agents against: (1) diseases that are caused by 
over-expression or misexpression of genes; and (2) diseases brought about by 
expression of genes that contain mutations. 

20 Successful siRNA-dependoit gene sileaicing depends on a number of factors. 

One of the most contaitious issues in RNAi is the question of the necessity of siRNA 
design, i.e., considering the sequence of the siRNA used. Early work in C. elegans 
and plants circumvented the issue of design by introducing long dsRNA (see, for 
instance. Fire, A. et al (1998) Nature 391 :806-81 1). In this primitive organism, long 

25 dsRNA molecules are cleaved into siRNA by Dicer, thus generating a diverse 

population of duplexes that can potentially cover the entire transcript. While some 
fi-action of these molecules are non-functional {i.e. induce little or no silencing) one or 
more have, the potential to be highly functional, thereby silencing the gene of interest 
and alleviating the need for siRNA design. Unfortunately, due to the interferon 

30 response, this same approach is unavailable for manmialian systems. While this effect 
can be circumvented by bypassing the Dicw cleavage step and directly introducing 
siRNA, this tactic carries with it the risk that the chosen siRNA sequence may be non- 
functional or semi-fimctional. 
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A number of researches have expressed the view that siRNA design is not a 
crucial element of RNAi. On the other hand, others in the field have begun to explore 
the possibility that RNAi can be made more efficient by paying attention to the design 
of the siRNA. Unfortunately, none of the rq)orted methods have provided a 
5 satisfactory scheme for reliably selecting siRNA with acceptable levels of 

functionality. Accordingly, there is a need to develop rational criteria by which to 
select siRNA with an acceptable level of functionality, and to identify siRNA that 
have this improved level of functionality, as well as to identify siRNAs that are 
hyperfunctional. 

10 

Sammiary of the Invention 

The present invention is directed to increasing the efficiency of RNAi, 
particularly in mammalian systems. Accordingly, the present invention provides kits, 
siRNAs and methods for increasing siRNA efficacy. 

15 

According to one embodiment, the present invention provides a kit for gene 
silencing, wherein said kit is comprised of a pool of at least two siRNA duplexes, 
each of which is comprised of a sequence that is complementary to a portion of the 
sequence of one or more target messenger RNA. 

20 

According to a second embodiment, the present invention provides a method 
for optimizing RNA interference by using one or more siRNAs that are optimized 
according to a formula (or algorithm) selected fi-om: 
Fonnula I 

25 Relative functionality of siRNA= -(GC/3) +(AU,5.i9) -(Tm2o<'c)*3 -<G,3)*3 -<Ci9) 
+(Ai9)*2+(A3) +(U,o)+(Ai4)^5)"(Au) 

Formula II 

Relative fimctionality of siRNA= -(GC/3) -(AUi5-i9)*3 "(G,3)*3 -(C19) +(Ai9)*2 
30 +(A3) 

Formula III 

Relative functionality of siRNA- -(GC/3) +(AUi5.i9) -(Tm2ox)*3 
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Formula IV 

Relative fiinctionality of siRNA= 

-GC/2+( AU,s-i9)/2-(Tm2o^)*2 -(G,3)*3 -(C19) +(A,9)*2 +(A3) "KUioHAu) -(Uj) - 
(An) 

5 

Formula V 

Relative functionality of siRNA=-(G,3)*3 -(€,9) ■KAi9)*2 +(A3) + (Uio)+(A,4) -(U5) 
-<An) 

10 Formula VI 

Relative functionality of siRNA=-(Gi3)*3 -<Ci9) +(Ai9)*2 +(A3) 

Formula VII 

Relative functionality of siRNA=-(GC/2) +(AU,5.i9)/2 -( Tm2o«c)*l -(G,3)*3 -(Ci?) 
15 +(Ai9)*3 +(A3)*3 +(U,o)/2+(A,4)/2 -(U5)/2 -(Ah)/2 



vi^hCTein in Formulas I - VII: 

Tm 2o°c=l if the Tm is greater than 20°C; 

Ai9 = 1 if A is the base at position 19 on the sense strand, otherwise its value 

isO; 

AU15.19 = 0-5 depending on the number of A or U bases on the sense strand 
positions 15-19; 

G|3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 

0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise its value is 

0; 

GC= the number of G and C bases in the entire sense strand; 

A3= 1 if A is the base at position 3 on the sense strand, otherwise its value is 0; 

Aii= 1 if A is the base at position 1 1 on the sense strand, otherwise its value is 

0; 

Ai4= 1 if A is the base at position 14 on the sense strand, otherwise its value is 

0; 
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Uio- 1 if U is the base at position 10 on the sense strand, otherwise its value is 

0; 

U5= 1 if U is the base at position 5 on the sense strand, otherwise its value is 0; 

or 

Formula VIII Relative functionality of siRNA = 

(-14)*G,3-13*A,-12*U7-ll*U2-10*An-10*U4-10*C3-10*C5-10*C6- 
9*A,o-9*U9-9*C,8-8*Gio-7*U,-7*U,6-7*C,7-7*C,9 
+7*U,7+8*A2+8*A4 +8*A5+8*C4 +9*G8 +10*A7+10*U,8+1 1 *A,9+ 

ll*C9+15*G,+ 18*A3+19*U,o-Tm-3*(GC.„,ai)-6*(GC,5_,9)- 
30*X;and , 



Formula DC Relative functionality of siRNA = 

(14.1)*A3+(14.9)*A6+(17.6)*A,3+(24.7)*A,9+(14.2)*U,o+(10.5)* 
*5 -■C9+(23.9)*G,-K16.3)*G2+(-12.3)*A„+(-19.3)*U,+(-12.1)*U2+ 
(-1 l)*U3+(-15.2)*U,s+(-l 1.3)*U,6+ (-1 1 .8)*C3+(-17.4)*C6+(- 
10.5)*C7+ (-13.7)*G,3+(-25.9)*G,9-Tm-3* (GCto,ai) - 6*(GC,5-,9)- 
30*X 

wherein 

Ai = 1 if A is the base at position 1 of the sense strand,^tiierwise its value is 0; 
A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 
A3 = 1 if A is die base at position 3 of die sense strand, otherwise its value is 0; 
A4 ^ 1 if A is the base at position 4 of die sense strand, otherwise its value is 0; 
A5 = 1 if A is the base at position 5 of die sense strand, otherwise its value is 0; 
25 A^ = 1 if A is tiie base at position 6 of die sense strand, otherwise its value is 0; 
A7 = 1 if A is die base at position 7 of the sense strand, otiierwise its value is 0; 
Aio= 1 if A is die base at position 10 of die sense strand, odierwise its value is 0; 
An = 1 if A is die base at position 11 of die sense strand, odierwise its value is 0; 
Ai3 = 1 if A is die base at position 13 of die sense sh-and, odierwise its value is 0; 
30 Ai9 = 1 if A is die base at position 1 9 of die sense strand, odierwise if anodiar base is 
present or die sense sti^d is only 1 8 base pairs in lengdi, its value is 0; 

C3 = 1 if C is die base at position 3 of die sense strand, odierwise its value is 0; 
C4 = 1 if C is the base at position 4 of die sense stiiand, otherwise its value is 0; 
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Cs = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 
C6 = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 
C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 
C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
5 Ci7 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
C18 = 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 
Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 

10 Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 

G2= 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 

G8= 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 

Gio= 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 

Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
15 Gi9= 1 if G is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 

U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
20 U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 

U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 

U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 

U9 = i if U. is. the base at position 9 on the sense strand, otherwise its value is 0; 

Uio = 1 if U is ttie base at position 10 on the sense strand, otherwise its value is 0; 
25 Ui5 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 

U16 == 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 

Ui7 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 

U18 = 1 if U is the base at position 1 8 on the sense strand, otherwise its value is 0; 

30 GC15-19 = the number of G and C bases within positions 15 - 19 of the sense strand 
or within positions 15 -1 8 if the sense strand is only 18 base pairs in length; 

GCtotai = the number of G and C bases in the sense strand; 

Tm= 100 if the targeting site contains an inverted repeat longer than 4 base pairs, 
otherwise its value is 0; and 
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X - the number of times that the same nucleotide repeats four or more times in a row. 

According to a third embodiment, the present invention is directed to a kit 
comprised of at least one siRNA that contains a sequence that is optimized according 
to one of the formulas above. Preferably the kit contains at least two optimized 
siRNA, each of which comprises a duplex, wherein one strand of each duplex 
comprises at least eighteen contiguous bases that are complementary to a region of a 
target messenger RNA. For mammalian systems, the siRNA preferably comprises 
between 18 and 30 nucleotide base pairs. 

The abiUty to use the above algorithms, which are not sequence or species 
specific, allows for the cost-effective selection of optimized siRNAs for specific 
target sequences. Accordingly, there will be both greater efficiency and reliability in 
the use of siRNA technologies. 

According to a fourth embodiment, the present invention provides a method 
for developing an siRNA algorithm for selecting functional and hyperfunctional 
siRNAs for a given sequence. The method comprises: 

(a) selecting a set of siRNAs; 

(b) measuring the gene silencing ability of each siRNApfi-om said set; 

(c) determining the relative functionality of each siRNA; 

(d) determining the amount of improved functionality by the presence or 
,i absmice of at least one variable selected from the group consisting of 

the total GC content, melting temperature of the siRNA, GC content at 
positions 15-19, the presence or absence of a particular nucleotide at a 
particular position and the number of times that the same nucleotide 
repeats within a given sequence; and 

(e) developing an algorithm using the information of step (d). 

According to this embodiment, preferably the set of siRNAs comprises at least 
90 siRNAs firom at least one gene, more preferably at least 1 80 siRNAs fi-om at least 
two different genes, and most preferably at least 270 and 360 siRNAs firom at least 
three and four different genes, respectively. Additionally, in step (d) the 
determination is made with preferably at least two, more preferably at least three, 
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even more preferably at least four, and most preferably all of the variables. The 
resulting algoriflun is not target sequence specific. 

In a fifth embodiment, the present invention provides rationally designed 
5 siRNAs identified using the formulas above. 

In a sixth embodiment, the present invention is directed to hyperfiinctional 

siRNA. 

10 For a better understanding of the present invention together with other and 

fiarther advantages and embodiments, reference is made to the following description 
taken in conjunction with the examples, the scope of which is set forth in the 
appended claims. 

15 Brief Description of the Figures 

Figure 1 shows a model for siRNA-RISC interactions. RISC has the ability to interact 
with either end of the siRNA or miRNA molecule. Following binding, the duplex is 
unwound, and the relevant target is identified, cleaved, and released. 

20 Figure 2 is a representation of the fiinctionality of two hundred and seventy siRNA 
duplexes that were generated to target human cyclophilin, human diazepam-binding 
inhibitor (DB), and firefly ludferase. 

Figure 3a is a representation of the silencing effect of 30 siRNAs in three different 
25 cells lines, HEK293, DU145, and Hela. Figure 3b shows the fi^uency of different 
functional groups (>95% silencing (black), >80% silencing (gray), >50% silencing 
(daric gray), and <50% silencing (white)) based on GC content. In cases where a given 
bar is absent fi-om a particular GC percentage, no siRNA were identified for that 
particular group. Figure 3c shows the frequency of different functional groups based 
30 on melting temperature (Tm). Again, each group has four different divisions: >95% 
(black), >80% (gray), >50% (dark gray), and <50% (white) silencing. 

Figure 4 is a representation of a statistical analysis that revealed correlations between 
silencing and five sequence-related properties of siRNA: (A) an A at position 19 of 
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the sense strand, (B) an A at position 3 of the sense strand, (C) a U at position 10 of 
the sense strand, (D) a base other than G at position 13 of the sense strand, and (E) a 
base other than C at position 19 of the sense strand. All variables were correlated with 
siRNA silencing of firefly ludferase and hianan cyclophilin. SiRNAs satisfying the 
5 criterion are grouped on the left (Selected) while those that do not, are grouped on the 
right (EUminated). Y-axis is "% Silencing of Control." Each position on the X-axis 
represents a unique siRNA. 

Figures 5 A and 5 B are representations of firefly luciferase and cyclophilin siRNA 
1 0 panels sorted according to fimctionality and predicted values using Formula VIII. The 
siRNA found within the circle represent those that have Formula Vni values 
(SMARTscores™) above zero. SiRNA outside the indicated area have calculated 
Formula VIII values that are below zero. Y-axis is "Expression (% Control)." Each 
position on the X-axis represents a unique siRNA. 

15 

Figure 6A is a representation of the average internal stability profile (AISP) derived 
fi-om 270 siRNAs taken from three separate genes (cyclophilin B, DBI and firefly 
luciferase). Graphs represent AISP values of highly fiinctional, fijnctional, and non- 
fiinctional siRNA. Figure 6B is a comparison between the AISP of naturally daived 
20 GFP siRNA (filled squares) and the AISP of siRNA firom cyclophilin B^J^BI, and 
luciferase having >90% silencing propaties (no fill) for the antisense strand. "DO" is 
the symbol for AG, fi-ee enwgy. 

Figure 7 is a histogram showing the differences in duplex fimctionality upon 
25 introduction of basepair mismatches. The X-axis shows the mismatch introduced inot 
the siRNA and the position it is intoroduced (e.g., 8C->A reveals that position 8 (which 
normally has a C) has been changed to an A). The Y-axis is "% Silencing 
(Normalized to Control)." 

30 Figure 8a is histogram that shows the effects of 5'saise and antisense strand 
modification with 2'-0-methylation on fimctionality. Figure 8b is an expression 
profile showing a comparison of sense sti-and off-target effects for IGFlR-3 and 2'-0- 
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methyl IGFlR-3. Sense strand off-targets (lower white box) are not induced when the 
5' end of the sense strand is modified with 2'-0-methyl groups (top white box). 



Figure 9 shows a graph of SMARTscores™ versus RNAi silencing values for more 
5 than 360 siRNA directed against 30 different genes. SiRNA to the right of the vertical 
bar represent those siRNA that have desirable SMARTscores™. 

Figures lOA - E compare the RNAi of five different genes (SEAP, DBl, PLK, 
Firefly Luciferase, and Renila Luciferase) by varying numbers of randomly selected 

1 0 siRNA and four rationally designed (SMART-selected) siRNA chosai using the 

algorithm desaibed in Formula VIII. In addition, RNAi induced by a pool of the four 
SMART-selected siRNA is reported at two different concentrations (100 and 400nM). 
IDF is a comparison between a pool of randomly selected EGFR siRNA (Pool 1) and 
a pool of SMART selected EGFR siRNA (Pool 2). Pool 1, SI— S4 and Pool 2 SI— 

15 S4 represent the individual members that made up each respective pool. Note that 

numbers for random siRNAs represent the position of the 5' end of the sense strand of 
the duplex. The Y-axis represents the % expression of the control(s). The X-axis is 
the percent expression of the control. 

20 Figure 1 1 shows the Western blot results from cells treated with siRNA directed 
against twelve different genes involved in the clathrin-dependent endocytosis 
pathway (CHC, Dynll, CALM, CLCa, CLCb, Epsl5, EpslSR, RabSa, RabSb, RabSc, 
p2 subunit of AP-2 and EBArl). SiRNA were selected using Formula VHI. "Pool" 
represents a mixture of diq>lexes 1-4. Total concentration of each siRNA in flie pool 

25 is 25 nM. Total concoitration = 4 x 25 = 1 00 nM. 

Figure 12 is a representation of the gene silencing capabilities of rationally-selected 
siRNA directed against ten different genes (human and mouse cyclophilin, C-myc, 
human lamin A/C, QB (ubiquinol-cytochrome c reductase core protein I), MEKl and 
30 MEK2, ATEl (arginyl-tRNA protein transferase), GAPDH, and Eg5). The Y-axis is 
the percent expression of the control. Numbers 1, 2, 3 and 4 represent individual 
rationally selected siRNA. "Pool" represaits a mixture of the four individual siRNA. 
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Figure 13 is the sequence of the top ten Bcl2 siRNAs as determined by Formula VIII. 
Sequaices are listed 5' to 3'. 



Figure 14 is the knockdown by the top ten Bcl2 siRNAs at lOOnM concentrations. 
5 The Y-axis represents the amount of expression relative to the non-specific (ns) and 
transfection mixture control. 

Figure 15 represents a functional walk where siRNA beginning on every other base 
pair of a region of the luciferase gene are tested for the ability to silence the luciferase 
10 gene. The Y-axis represents the percent ejqpression relative to a control. The X-axis 
represents the position of each individual siRNA. 

Figure 16 is a histogram demonstrating the inhibition of target gene expression by 
pools of 2 and 3 siRNAs duplexes taken from the walk described in Figure 15. The 
1 5 Y-axis represents the percent expression relative to control. The X-axis represents the 
position of die first siRNA in paired pools, or trios of siRNA. For instance, the first 
paired pool contains siRNA 1 and 3. The second paired pool contains siRNA 3 and 5. 
Pool 3 (of paired pools) contains siRNA 5 and 7, and so on. 

20 Figure 17 is a histogram demonstrating the inhibition of target gene expression,by 
pools of 4 and 5 siRNA duplexes. The Y-axis represents the percent expression 
relative to a control. The X-axis r^esents the positoin of the first siRNA in each 
pool. 

25 Figure 18 is a histogram demonstrating the inhibition of target gene expression by 
siRNAs that are ten and twenty basepairs apart. The Y-axis represents the percent 
expression relative to a control. The X-axis represents the position of the first siRNA 
in each pool. 

30 Figure 19 shows that pools of siRNAs (dark gray bar) work as well (or better) than 
the best siRNA in the pool (light gray bar). The Y-axis represents the percent 
expression relative to a control. The-X axis represents the position of the first siRNA 
in each pool. 



12 



Figure 20 shows that the combination of several semifunctional siRNAs (dark gray) 
result in a significant improvement of gene expression inhibition over individual 
(semi-functional; light gray) siRNA. The Y-axis represents tiie percent expression 
relative to a control. 

5 

Figure 21 shows both pools (Library, Lib) and individual siRNAs in inhibition of 
gene expression of Beta-Galactosidase, Renilla Luciferase and SEAP (alkaline 
phosphatase). Numbers on the X-axis indicate the position of the 5'-most nucleotide 
of the sense strand of the duplex. The Y-axis represents the percent expression of 

10 each gene relative to a control. Libraries contain siRNAs that begin at the following 
nucleotides: Seap: Lib 1: 206, 766, 812,923, Lib 2: 11 17, 1280, 1300, 1487, Lib 3: 
206, 766, 812, 923, 1117, 1280, 1300,1487v Lib 4: 206, 812, 1117, 1300, Lib 5: 766, 
923, 1280, 1487, Lib 6: 206, 1487; Bgal: Lib 1 : 979. 1339, 2029, 2590, Lib 2: 
1087,1783,2399,3257, Lib 3: 979, 1783, 2590, 3257, Lib 4: 979, 1087, 1339, 1783, 

15 2029, 2399,2590,3257, Lib 5: 979, 1087. 1339, 1783, Lib 6: 2029,2399,2590,3257- 
ReniUa:Libl: 174,300,432,568, Lib 2: 592, 633, 729,867, Lib 3: 174,300,432, 
568, 592, 633,729,867, Lib 4: 174,432,592,729, Lib 5: 300,568,633,867, Lib 6: 
592,568. 

€0 

Figure 22 showS the results of an EGFR and TfiiR internalization assay wh«ti single 
gene knockdowns are performed. The Y-axis represents percent internalization 
relative to control. v . ^.^ 

25 Figure 23 shows the results of an EGFR and TfiiR internalization assay when 
multiple genes are knocked down {e.g. Rab5a, b, c). The Y-axis represents the 
percent internalization relative to control. 

Figure 24 shows the simultaneous knockdown of four different genes. SiRNAs 
30 directed against G6PD, GAPDH, PLK, and UBQ were simultaneously introduced into 
cells. Twenty-four hours later, cultures were harvested and assayed for mRNA target 
levels for each of the four genes. A comparison is made between cells transfected 
with individual siRNAs vs. a pool of siRNAs durected against all four genes. 
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Figure 25 shows the functionality of ten siRNAs at O.SnM concentrations. 

Detailed Description 
Defimitioiis 

5 Unless stated otherwise, the following terms and phrases have the meanings 

provided below: 

siRNA 

The term "siRNA" refers to small inhibitory RNA duplexes that induce the 
1 0 RNA interference (RNAi) pathway. These molecules can vary in length (generally 
between 18-30 basq)airS) and contain varying degrees of complementarity to their 
target mRNA in the antisense strand: Some, but not all, siRNA have unpaired 
overhanging bases on the 5' or 3' end of the sense strand and/or the antisense strand. 
The term "siRNA" includes duplexes of two separate strands, as well as single strands 
1 5 that can form hairpin structures comprising a duplex region. 

SiRNA may be divided into five (5) groups (non-functional, semi-functional, 
functional, highly functional, and hyper-functional) based on the level or degree of 
silencing that they induce in cultured cell lines. As used herein, these definitions are 

20 v based on a set of conditions where the siRNA is transfected into said cell line at a 
concentration of lOOnM and the level of silencing is tested at a time of roughly 24 
hours after transfection, and not exceeding 72 hows after transfection. In this context, 
"non-functional siRNA" are defined as those siRNA that induce less than 50% 
(<50%) target silencing. "Semi-functional siRNA" induce 50-79% target silencing. 

25 "Functional siRNA" are molecules that induce 80-95% gene silencing. "Highly- 
functional siRNA" are molecules that induce greater than 95% gene silencing. 
"Hyperfunctional siRNA" are a special class of molecules. For purposes of this 
document, hyperfunctional siRNA are defined as those molecules that: (1) induce 
greater than 95% silencing of a specific target when they are transfected at 

30 subnanomolar concentrations (i.e., less than one nanomolar); and/or (2) induce 
functional (or better) levels of silencing for greater than 96 hours. These relative 
functionalities (though not intended to be absolutes) may be used to compare siRNAs 
to a particular target for applications such as functional genomics, target identification 
and therapeutics. 
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miRNA 

The term "miRNA" refers to microRNA. 

5 Gene silencing 

The phrase "gene silencing" refers to a process by which the expression of a 
specific gene product is lessened or attenuated. Gene silencing can take place by a 
variety of pathways. Unless specified otherwise, as used herin, gene silencing refers 
to decreases in gene product expression that results from RNA interference (RNAi), a 

1 0 defined, though partially characterized pathway whereby small inhibitory RNA 

(siRNA) act in concert with host proteins {e.g. the RNA induced silencing complex, 
RISC) to degrade messengerlS^A (mRNA) in a sequence-dependent fashion. The 
level of gene silencing can be measured by a variety of means, including, but not 
limited to, measur«nent of transcript levels by Northern Blot Analysis, B-DNA 

15 techniques, transcription-sensitive reporter constructs, expression profiling (e.g. DNA 
chips), and related technologies. Alternatively, the level of silencing can be measured 
by assessing the level of the protein encoded by a specific gene. This can be 
accomplished by perfonning a number of studies including Westem Analysis, 
measuring the levels of expression of a reporter protein that has €,g. fluorescent 

20 .properties (e.g. GFP) or enzymatic activity (e,g. alkaline phosphatases), or several 
other procedures. 

Transfection ..^ v^t^^ 

The term "transfection" refers to a process by which agents are introduced into 

25 a cell. The list of agents that can be transfected is large and includes, but is not 
limited to, siRNA, sense and/or anti-sense sequences, DNA encoding one or more 
genes and organized into an expression plasmid, proteins, protein fragments, and 
more. There are multiple methods for transfecting agents into a cell including, but not 
limited to, electroporation, calcium phosphate-based transfections, DEAE-dextran- 

30 based transfections, lipid-based transfections, molecular conjugate-based transfections 
(e.g, polylysine-DNA conjugates), microinjection and others. 
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Target 

The term "target" is used in a variety of different forms throughout this 
document and is defined by the context in which it is used. "Target mRNA" refers to 
a messenger RNA to which a given siRNA can be directed against "Target sequence" 
and "target site" refer to a sequence within the mRNA to which the sense strand of an 
siRNA shows varying degrees of homology and the antisense strand exhibits varying 
degrees of complementarity. The term "siRNA target" can refer to the gene, mRNA, 
or protein against which an siRNA is directed. Similarly "target silencing" can refer 
to the state of a gene, or the corresponding mRNA or protein. 

Off-target silencing and OfF-target interference 

The phrases "offirtarget silencing" and "off-target interference" are defined as 
degradation of mRNA other than the intended target mRNA due to overlapping and/or 
partial homology with secondary mRNA messages. 

SMARTscore™ 

The term "SMARTscore™" refers to a number determined by applying any of 
the Formulas I - Formula IX to a given siRNA sequence. The term "SMART- 
selected" or "rationally selected" or "rational selection" refers to siRNA that have 
been selected on the basis of their SMARTscores™. 

Complementary 

The term "complementary" refers to^the ability of polynucleotides to form 
base pairs with one another. Base pairs are typically formed by hydrogen bonds 
between nucleotide units in antiparallel polynucleotide strands. Complementary 
polynucleotide strands can base pair in the Watson-Crick manner (e.g., A to T, A to 
U, C to G), or in any other manner that allows for the formation of duplexes. As 
persons skilled in the art are aware, when using RNA as opposed to DNA, uracil 
rather than thymine is the base that is considered to be complementary to adenosine. 
However; when a U is denoted in the context of the present invention, the ability to 
substitute a T is implied, unless otherwise stated. 

Perfect complementarity or 100% complementarity refers to the situation in 
which each nucleotide unit of one polynucleotide strand can hydrogen bond with a 
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nucleotide unit of a second polynucleotide strand. Less than perfect complementarity 
refers to the situation in which some, but not all, nucleotide units of two strands can 
hydrogen bond with each other. For example, for two 20-mers, if only two base pairs 
on each strand can hydrogen bond with each other, the polynucleotide strands exhibit 
10% complementarity. In the same example, if 18 base pairs on each strand can 
hydrogen bond with each other, the polynucleotide strands exhibit 90% 
complementarity. ''Substantial complementarity'' refers to polynucleotide strands 
exhibiting 79% or greater complementarity, excluding regions of the polynucleotide 
strands, such as overhangs, that are selected so as to be noncomplementary. 
("Substantial similarity'' refers to polynucleotide strands exhibiting 79% or greater 
similarity, excluding regions of the polynucleotide strands, such as overhangs, that are 
selected so as not to be similar.) Thus^for example, two polynucleotides of 29 
nucleotide units each, wherein each comprises a di-dT at the 3' terminus such that the 
duplex region spans 27 bases, and wherein 26 of the 27 bases of the duplex region on 
each strand are complementary, are substantially complementary since they are 96.3% 
complementary when excluding the di-dT overhangs. 

Deoxvnucleotide 

The term "deoxynucleotide" refers to a nucleotide or polynucleotide lacking a 
hydroxy! group (OH group) at the 2' and/or 3' position of a sugar moiety. Instead, it 
has a hydrogen bonded to the 2' and/or 3' carbon. Within an RNA molecule that 
comprises one or more deoxynucleotides, "deoxynucleotide" refers to the lack of an 
OH group at the 2' position of the sugar moiety^Aaadsfrinstead a hydrogen bonded 
directly to the 2' carbon. 

Deoxvribonucleotide 

The terms "deoxyribonucleotide" and "DNA" refer to a nucleotide or 
polynucleotide comprising at least one sugar moiety that has an H, rather than an OH, 
at its 2' and/or 3 'position. 

Substantially Similar 

The phrase "substantially similar" refers to a similarity of at least 90% with 
respect to the identity of the bases of the sequence. 
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Duplex Region 

The phrase "duplex region" refers to the region in two complementary or 
substantially complementary polynucleotides that form base pairs with one another, 
either by Watson-Crick base pairing or any other manner that allows for a stabilized 
5 duplex between polynucleotide strands that are complementary or substantially 

complementary. For example, a polynucleotide strand having 21 nucleotide units can 
base pair with another polynucleotide of 21 nucleotide units, yet only 1 9 bases on 
each strand are complementary or substantially complementary, such that the "duplex 
region" has 19 base pairs. The remaining bases may, for example, exist as 5' and 3' 
1 0 overhangs. Further, within the duplex region, 1 00% complementarity is not required; 
substantial complementarity is allowable within a duplex region. Substantial 
complementarity refers to 79% or greater complementarity. For example, a mismatch 
in a duplex region consisting of 19 base pairs results in 94.7% complementarity, 
rendering the duplex region substantially complementary. 

15 

Nucleotide 

The term "nucleotide" refers to a ribonucleotide or a deoxyribonucleotide or 
modified form thereof, as well as an analog thereof. Nucleotides include species that 
comprise purines, e.g., adenine, hypoxanthine, guanine, and their derivatives and 
20 analogs, as .well as pyrimidines, e.g., cytosine, uracil, thymine, and their derivatives 
and analogs. 

Nucleotide analogs include nucleotides ha\^g-m©difications in the chOTiical 
structure of the base, sugar and/or phosphate, including, but not limited to, 5-position 

25 pyrimidine modifications, 8-position purine modifications, modifications at cytosine 
exocyclic amines, and substitution of 5-bromo-uracil; and 2'-position sugar 
modifications, including but not Umited to, sugar-modified ribonucleotides in which 
the 2'-OH is replaced by a group such as an H, OR, R, halo, SH, SR, NH2, NHR, 
NR2, or CN, wherein R is an alkyl moiety. Nucleotide analogs are also meant to 

30 include nucleotides with bases such as inosine, queuosine, xanthine, sugars such as 
2'-methyl ribose, non-natural phosphodiester linkages such as methylphosphonates, 
phosphorothioates and peptides. 
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Modified bases refer to nucleotide bases such as, for example, adenine, 
guanine, cytosine, thymine, uracil, xanthine, inosine, and queuosine that have been 
modified by the replacement or addition of one or more atoms or groups. Some 
examples of types of modifications that can comprise nucleotides that are modified 
5 with respect to the base moieties include but are not limited to, alkylated, halogenated, 
thiolated, aminated, amidated, or acetylated bases, individually or in combination. 
More specific examples include, for example, 5-propynyluridine, 5-propynylcytidine, 
6-methyIadenine, 6-methylguanine, N,N,-dimethyladenine, 2-propyladenine, 2- 
propylguanine, 2-aminoadenine, 1-methylinosine, 3-methyluridine, 5-methylcytidine, 

1 0 5-methyluridine and other nucleotides having a modification at the 5 position, 5-(2- 
amino)propyl uridine, 5-halocytidine, 5-halouridine, 4-acetylcytidine, 1- 
methyladenosine, 2-methyladenosine, 3-methylcytidine, 6-methyluridine, 2- 
methylguanosine, 7-methylguanosine, 2,2-dimethylguanosine, 5- 
methylaminoiethyluridine, 5-methyloxyuridine, deazanucleotides such as 7-deaza- 

15 adenosine, 6-azouridine, 6-azocytidine, 6-azothymidine, 5-methyl-2-thiouridine, other 
thio bases such as 2-thiouridine and 4-thiouridine and 2-thiocytidine, dihydrouridine, 
pseudouridine, queuosine, archaeosine, naphthyl and substituted naphthyl groups, any 
O- and N-alkylated purines and pyrimidines such as N6-methyladenosine, 5- 
methylcarbonylmethyluridine, uridine 5-oxyacetic acid, pyridine-4-one, pyridine-2- 

20 one, phenyl and modified phenyl groups such as aminophenol or 2,4,6-trimethoxy 
benzene, modified cytosines that act as G-clamp nucleotides, 8-substituted adenines 
and guanines, 5-substituted uracils and thymines, azapyrimidines, 
caiboxyhydroxyalkyl nucleotides, caiboxyalkylaminoaUcjia^Bideotides, and 
alkylcarbonylalkylated nucleotides. Modified nucleotides also include those 

25 nucleotides that are modified with respect to the sugar moiety, as well as nucleotides 
having sugars or analogs thereof that are not ribosyl. For example, the sugar moieties 
maybe, or be based on, mannoses, arabinoses, glucopyranoses, galactopyranoses, 4'- 
thioribose, and other sugars, heterocycles, or carbocycles. 

30 The tenn nucleotide is also meant to include what are known in the art as 

universal bases. By way of example, universal bases include but are not limited to 3- 
nitropyrrole, 5-nitroindole, or nebularine. The term "nucleotide" is also meant to 
include the N3' to P5' phosphoramidate, resulting firom the substitution of a ribosyl 3' 
oxygen with an amine group. 
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Further, the term nucleotide also includes those species that have a detectable 
label, such as for example a radioactive or fluorescent moiety, or mass label attached 
to the nucleotide. 

Polvnucleotide 

The term "polynucleotide" refers to polymers of nucleotides, and includes but 
is not limited to DNA, RNA, DNA/RNA hybrids including polynucleotide chains of 
regularly and/or irregularly alternating deoxyribosyl moieties and ribosyl moieties 
(/,e., wherein alternate nucleotide units have an -OH, then and -H, then an -OH, then 
an -H, and so on at the 2' position of a sugar moiety), and modifications of these 
kinds of polynucleotides, wherein the attachment of various entities or moieties to the 
nucleotide imits at any position are included. 

Polyribonucleotide 

The term "polyribonucleotide" refers to a polynucleotide comprising two or 
more modified or unmodified ribonucleotides and/or their analogs. The term 
"polyribonucleotide" is used interchangeably with the term "oligoribonucleotide/' 

Ribonucleotide and ribonucleic acid 

The term "ribonucleotide" and the phrase "ribonucleic acid" (RNA), refer to a 
modified or unmodified nucleotide or polynucleotide comprising at least one 
ribonucleotide unit. A ribonucleotide unit comprises an hjrfises^li^oup attached to 
the 2' position of a ribosyl moiety that has a nitrogenous base attached in N- 
glycosidic linkage at the T position of a ribosyl moiety, and a moiety that either 
allows for linkage to another nucleotide or precludes linkage. 

Detailed Description of the Invention 

The present invention is directed to improving the efficiency of gene silencing 
by siRNA. Through the inclusion of multiple siRNA sequences that are targeted to a 
particular gene and/or selecting an siRNA sequence based on certain defined criteria, 
improved efficiency may be achieved. 
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The present invention will now be described in connection with preferred 
embodiments. These embodiments are presented in order to aid in an understanding 
of the present invention and are not intended, and should not be construed, to limit the 
invention in any way. All alternatives, modifications and equivalents tiiat may 
5 become apparent to those of ordinary skill upon reading this disclosure are included 
within the spirit and scope of the present invention. 

Furthermore, this disclosure is not a primer on RNA interference, Basic 
concepts known to persons skilled in the art have not been set forth in detail. 

10 

Optimizing siRNA 

According to one embodiment, the present invention provides a method for 
improving the effectiveness of gene silencing for use to silence a particular gene 
through the selection of an optimal siRNA. An siRNA selected according to this 
1 5 method may be used individually, or in conjunction with the first embodiment, Le., 
with one or more other siRNAs, each of which may or may not be selected by this 
criteria in order to maximize their efficiency. 

The degree to which it is possible to select an siRNA for a given mRNA that 
20 maximizes these criteriawill depend on the sequence of the mRNA itself. However, 
the selection criteria will be independent of the target sequence. According to this 
method, an siRNA is selected for a given gene by using a rational design. That said, 
rational design can be described in a variety of ways. Rational>:desig&is,.in simplest 
terms, the application of a proven set of critOTa fliat enhance the probability of 
25 identifying a functional or hyperfiinctional siRNA. Li one method, rationally 
designed siRNA can be identified by maximizing one or more of the following 
criteria: 



1 . A low GC content, preferably between about 30 -52%. 
30 2. At least 2, preferably at least 3 A or U bases at positions 1 5- 1 9 of the 

siRNA on the sense strand. 

3 . An A base at position 1 9 of the sense strand. 

4. An A base at position 3 of the sense strand. 

5. A U base at position 1 0 of the sense strand. 
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6. An A base at position 14 of the sense strand. 

7. A base other than C at position 19 of the sense strand. 

8. A base other than G at position 1 3 of the sense strand. 

9. A Tm, which refers to the character of the internal repeat that results in 
inter- or intramolecular structures for one strand of the duplex, that is 
preferably not stable at greater than 50^C, more preferably not stable at 
greater than even more preferably not stable at greater than 30°C 
and most preferably not stable at greater than 20^C. 

10. A base other than U at position 5 of the sense strand. 

11. A base other than A at position 1 1 of the sense strand. 



Criteria 5, 6, 10 and 1 1 are minor criteria, bubare^rronetheless desirable. 
Accordingly, preferably an siRNA will satisfy as many of the aforementioned criteria 
as possible, more preferably at least 1 - 4 and 7-9, and most preferably all of the 
15 oiteria 



With respect to the criteria, GC content, as well as a high number of AU in 
positions 15-19, may be important for easement of the unwinding of double stranded 
siRNA duplex. Duplex unwinding has been shown to be crucial for siRNA 
20 fiinctionality m viva. 

With respect to aiterion 9, the internal structure is measured in terais of the 
meltmg temperature of the single strand of siRNA, which is the te^paealufe at which 
50% of the molecules will become denatured, Wifli respect to criteria 2 - 8 and 10 - 
25 1 1 , the positions refer to sequence positions on the sense strand, which is the strand 
that is identical to the mRNA. 

In one preferred embodiment, at least criteria 1 and 8 are satisfied. In another 
preferred embodiment, at least criteria 7 and 8 are satisfied. In still anottier preferred 
30 embodiment, at least criteria 1, 8 and 9 are satisfied. 

It should be noted that all of the aforementioned criteria regarding sequence 
position specifics are with respect to the 5' end of tiie sense strand; Reference is 
made to the sense strand, because most databases contain information that describes 



22 



the infonnation of the mRNA. Because according to the present invention a chain can 
be from 18 to 30 bases in lengA, and the aforementioned criteria assumes a chain 19 
base pairs in length, it is important to keep the aforementioned criteria applicable to 
the correct bases. 

5 

When there are only 1 8 bases, the base pair that is not present is the base pair 
that is located at the 3' of the sense strand. When there are twenty to thirty bases 
present, then additional bases are added at the 5' end of the sense chain and occupy 
positions ~1 to "1 1 . Accordingly, with respect to SEQ. ID NO. 0001 . 
10 NNANANNNNUCNAANNNNAandSEQ.IDNO.0028. 

GUCNNANANNNNUCNAANNNNA, both would have A at position 3, A at 
position 5, U at position 10, C at position 1 1, A and*p6sition 13, A and position 14 
and A at position 19. Howevor, SEQ. ID NO. 0028 would also have C at position -1, 
U at position -2 and G at position -3. 

15 

For a 19 base pair siRNA, an optimal sequence of one of the strands may be 
represented below, where N is any base, A, C, G, or U: 

SEQ. ID NO. 0001. NNANANNNNUCNAANNNNA 
20 SEQ. ID NO. 0002. NNANAMNNNUGNAANNNNA 
SEQ. ID NO. 0003. NNANANNNNLTUNAANNNNA 
SEQ. ID NO. 0004. NNANANNNNUCNCANNNNA 

SEQ. ID NO. 0005. NNANANNNNUCNCANNNNA «-.«a***^ 

SEQ. ID NO. 0006. NNANANNNNUUNCANNNNA 
25 SEQ. ID NO. 0007. NNANANNNNUCNUANNNNA 

SEQ. ID NO. 0008.. NNANANNNNUCNUANNNNA 

SEQ. ID NO. 0009. NNANANNNNUUNUANNNNA 

SEQ. ID NO. 0010. NNANCNNNNUCNAANNNNA 

SEQ. ID NO. 001 1 . NNANCNNNNUCNAANNNNA 
30 SEQ. IDNO. 0012.NNANCNNNNUUNAANNNNA 

SEQ. ID NO. 0013. NNANCNNNNUCNCANNNNA 

SEQ. ID NO. 0014. NNANCNNNNUGNCANNNNA 

SEQ. ID NO. 0015. NNANCNNNNUUNCANNNNA 

SEQ. ID NO. 0016. NANCNNNNUCNUANNNNA 
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SEQ. ID NO. 0017. NNANCNNNNUGNUANNNNA 
SEQ. ID NO. 0018. NNANCNNNNUUNUANNNNA 
SEQ. ID NO. 0019. NNANGNNNNUCNAANNNNA 
SEQ. ID NO. 0020. NNANGNNNNUCNAANNNNA 
5 SEQ. ID NO. 0021 . NNANGNNNNUUNAANNNNA 
SEQ. ID NO. 0022. NNANGNNNNUCNCANNNNA 
SEQ. ID NO. 0023. NNANGNNNNUCNCANNNNA 
SEQ. ID NO. 0024. NNANGNNNNUUNCANNNNA 
SEQ. ID NO. 0025. NNANCNNNNUGNUANNNNA 
10 SEQ. ID NO. 0026. NNANCNNNNUGNUANNNNA 
SEQ. ID NO. 0027. NNANGNNNNNUNUANNNNA 



In one embodiment, the sequence used as an siRNA is selected by choosing 
the siRNA that score highest according to one of the foUowing seven algorithms that 
15 . are represented by Formulas I - VII: 

Formula I 

Relative fiinctionality of siRNA= -(GC/3) +(AUi5-i9) -(Tm2o»c)*3 -(C,j)*3 -{C.s) 
+(A,9)*2+(A3) +(U,o)+(A,4)-(U5)-(Aii) 
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Formula 11 



Relative functionality of siRNA= -(GC/3) -<AU,5.i9)*3 -{G,3)*3 -(Cg) +(A,9)*2 



25 Formula III 

Relative functionality of siRNA= -(GC/3) -KAUis-^) -(Tm2o»c)*3 

Formula IV 

Relative functionality of siRNA= 

30 -CC/2+( AU,s-,9)/2-(Tm2o.c)*2 -(G,3)*3 -(C.j) +(A,9)*2 -KAj) +(U,o)+(A,4) -(Us) ■ 
(An) 



Formula V 
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Relative functionality of siRNA=-(G,3)*3 -(Cis) +(A,9)*2 +(A3) + (Uio)+(Ai4) -{U5) 
-(A„) 



Formula VI 

5 Relative fimctionality of siRNA=-(G,3)*3 -(C19) +(Ai9)*2 +(A3) 
Formula VII 

Relative fimctionality of siRNA=-(GC/2) +(AU,5-,9)/2 -( Tm2o«c)*l -{Gn)*3 -(0,9) 
+(Ai9)*3 +(A3)*3 +(U,o)/2+(Ai4)/2 -<U5)/2 -(Aii)/2 

10 

In Formulas I - VII: 

wherein A19 = 1 if A is the base at position -19 on the sense sti-and, othesrwise its value 
is 0, 

AU15.19 = 0-5 depending on tiie numba- of A or U bases on the sense strand 



15 at 
0; 

20 0; 



positions 15-19; 

Gi3 =1 if G is the base at position 13 on the sense strand, otherwise its value is 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise its value is 

GC= the number of G and C bases in the entire sense strand; 
Tm 20'»cf=l if the Tm is greater than 20''C; 

A3= 1 if A is the base at position 3 on the sense strand, otherwise its value is 0; 
Uio= 1 if U is the base at position 10 on the sense strand, o&erwise its value is 

A 14 = 1 if A is the base at position 14 on the sense strand, otherwise its value 

Us = 1 if U is the base at position 5 on the sense strand, otherwise its value is 

0; and 

30 All = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 

0, 

Formulas I -VII provide relative information regarding functionality. When 
the values for two sequences are compared for a given formula, the relative 

25 



25 0; 



is 0: 



functionality is ascertained; a higher positive number indicates a greater fimctionality. 
For example, in many applications a value of 5 or greater is beneficial. 

Additionally, in many applications, more than one of these fonnulas would 
5 provide useftil information as to the relative functionality of potential siRNA 

sequences. However, it is beneficial to have more than one type of formula, because 
not every formula will be able to help to differentiate among potential siRNA 
sequences. For example, in particularly high GC mRNAs, formulas that take that 
parameter into account would not be useful and application of fonnulas that lack GC 

1 0 elements {e.g.. formulas V and VI) might provide greater insights into duplex 
fimctionality. Similarly, formula II might by used in situations where hairpin 
structures are not observed in diqjlexesj and formula IV might be applicable for 
sequences that have higher AU content. Thus, one may consider a particular sequence 
in light of more than one or even all of these algorithms to obtain the best 
■■ 15 differaitiation among sequences. In some instances, application of a given algorithim 
may identify an unususally large number of potential siRNA sequences, and in those 
cases, it may be appropriate to re-analyze that sequence with a second algorithm that 
is, for instance, more stringent. Altematively, it is conceivable that analysis of a 
sequence with a given formula yields no acceptable siRNA sequraices (t.e. low 

20 SMARTscores™). In this instance, it may be appropriate to re-analyze that sequences 
with a second algorithm that is, for instance, less stringent. In still oflier mstances, 
analysis of a single sequence with two separate formulas may give rise to conflicting 
x^.^ . results (i.e. one formula generates a set of siRNA with high SMARTscores™ whil^^. 
the other formula identifies a set of siRNA wiA low SMARTscores"™). In these 

25 instances, it may be necessary to determine which weighted factor(s) (e.g. GC 
content) are contributing to the discrepancy and assessing the sequence to decide 
whether these factors should or should not be included. Altematively, the sequence 
could be analyzed by a third, fourth, or fifth algorithm to identify a set of rationally 
designed siRNA. 

30 

The above-referenced criteria are particulariy advantageous when used in 
combination with pooling techniques as depicted in Table I: 
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Table I 



Criteria 


Functional Probability 


Oligos 


Pools 


>95% 


>80% 


<70% 


>95% 


>80% 


<70% 


Current 


33.0 


50.0 


23.0 


79.5 


97.3 


0.3 


New 


50.0 


88.5 


8.0 


93.8 


99.98 


0.005 


(GC) 


28.0 


58.9 


36.0 


72.8 


97.1 


1.6 



The term "current" refers to Tuschl's conventional siRNA parameters (Elbashir, S.M. 
et al. (2002) "Analysis of gene fiinction in somatic mammalian cells using small 
5 interfering RNAs" Methods 26: 199-213). "New" refers to the design parameters 
described in Formulas I-VII. "GC" rel&S trcriteria that select siRNA solely on the 
basis of GC content. 

As Table I indicates, when more functional siRNA duplexes are chosen, 
10 siRNAs that produce <70% silencing drops from 23% to 8% and the number of 

siRNA duplexes that produce >80% silencing rises from 50% to 88.5%. Further, of 
the SiRNA duplexes with >80% silencing, a larger portion of these siRNAs actually 
silence >95% of the target expression (the new criteria increases the portion from 
33% to 50%). Using this new criteria in pooled SiRNAs, shows that, wifli pooling, the 
15 amount of silencing >95% increases from 79.5% to 93.8% and essentially eliminates 
any.siRNA pool from silencing less than 70%. 

i.Vs,». ■i'- -"-' • 

:> h 

Table II similarly shows the particularly beneficial results of pooling in 
combination wiA the aforementioned criteria. However, Table II, which takes into 
20 account each of the aforementioned variables, demonstrates even a greater degree of 
improvement in functionality. 



Table U 





Functional Probability 


Oligos 


Pools 


Functional 


Average 


Non- 
functional 


Functional 


Average 


Non- 
functional 


Random 


20 


40 


50 


67 


97 


3 
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Criteria 1 


52 


99 


0.1 


97 


93 


0.0040 


Criteria 4 


89 


99 


0.1 


99 


99 


0.0000 



The terms "functional," "Average," and "Non-functional" refer to siRNA that exhibit 
>80%, >50%, and <50% functionality, respectively. Criteria 1 and 4 refer to specific 
criteria described above. 

5 

The above-described algorithms may be used with or without a computer 
program that allows for the inputting of the sequence of the mRNA and automatically 
outputs the optimal siRNA. The computer program may, for example, be accessible 
from a local terminal or personal computer, over an internal network or over the 
10 Internet. - 
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In addition to the formulas above, more detailed algorithms may be used for 
selecting siRNA. Preferably, at least one RNA duplex of between 18 and 30 base 
pairs is selected such that it is optimized according a formula selected from: 



Fomiula VIII: (-14)*Gi3-13*A,-12*U7-l l*U2-10*An-10*U4-10*C3-10*C5-10*C6- 
9*Aio-9*U9-9*C,8-8*G,o-7*U,-7*U,6-7*C,7-7*C,9 
+7*Ui7+8*A2+8* A, +8*A5+8*C4 +9*G8 +1 0*A7+1 0*Ui8+l 1 *Ai9-l- 
20 1 1*C9+15*G,+ 18*A3+19*U,o-Tm-3* (GQoun) - 6*(GC,5-i9)- 

* 30*X;and 

FomiulalX: (14.1)*A3+(14.9)*A6+(17.6)*Ai3+(24.7)*A,9+(14.2)*U,o+(10.5)* 
C9+(23.9)*Gi+(l 6.3)*G2+(-12.3)*A, i+(-19.3) *Ui+(-l 2. 1)*U2+ 
25 (-1 l)*U3+(-15.2)*U,5+(-l 1 .3)*U,6+ (-1 1.8)*C3+(-17.4)*C6+(- 

10.5)*C7+ (-13.7)*G,3+(-25.9)*Gi!r-Tm-3* (GQ^ai) - 6*(GC,5-i9)- 
30*X 

wherein 

30 Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 
A2 = 1 if A is the base at position 2 of the sense strand, othwwise its value is 0; 
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A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 
A4= 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
As = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
A6= 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 
Ay = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 
Aio= 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 
An = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 
Ai3= 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 
Ai9= 1 if A is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 



C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 

C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 

Cs = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 
15 C6 = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 

Ci7 = I if C is the base at position 17 of the sense strand, otherwise its value is 0; 

C18 = 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 
20 Ci9 = 1 if C is the base at position 1 9 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 

Gi = 1 if G is the base at position 1 on the sense strand, oth^wise its value is 0; 
G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
25 Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 

Gio = 1 if G is the base at position 1 0 on the sense strand, otherwise its value is 0; • 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
Gi9 = 1 if G is the base at position 1 9 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 

30 

U| = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U4 =7 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
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1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
1 if U is the base at position 1 5 on the sense strand, otherwise its value is 0; 
1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 
1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 

GCi5-i9 = the number of G and C bases within positions 15 - 19 of the sense 
strand, or within positions 15 -18 if the sense strand is only 1 8 base pairs in 
length; 

GQoui = the number of and C bases in the sense strand; 
Tm = 100 if the siRNA oligo has the internal repeat longer then 4 base pairs, 
otherwise its value is 0; and 

X = the number of times that the same nucleotide repeats four or more times in a 
row. 

The above formulas VIII and EX, as well as formulas I - VII, provide methods 
for selecting siRNA in order to increase the efBciency of gene silencing. A subset of 
variables of any of the formulas may be used, though when fewer variables are used, 
the optimization hierarchy becomes less reliable. 

. With respect to the variables of the above-referenced formulas, a single letter 
of A or C or G or U followed by a subscript refers to a binary condition. The binary 
condition is that either tiie particular base is pres«it at that particular position 
(wherein the value is "1") or the base is not present (wherein the value is "0"). 
Because position 19 is optional, i.e. there might be only 18 base pairs, when there are 
only 18 base pairs, any base with a subscript of 19 in the formulas above would have 
a zero value for that parameter. Before or after each variable is a number followed by 
*, which indicates that the value of the variable is to be multiplied or weired by that 
number. 

The numbers preceding the variables A, or G, or C, or U in Formulas VIII and 
DC (or after the variables in Formula I - VII) were determined by comparing the 
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difference in the frequency of individual bases at different positions in fimctional 
siRNA and total siRNA. Specifically, the frequency in which a given base was 
observed at a particular position in fimctional groups was compared with the 
frequency that that same base was observed in the total, randomly selected siRNA set. 
If the absolute value of the difference between the fimctional and total values was 
found to be greater than 6%, that parameter was included in the equation. Thus for 
instance, if the frequency of finding a "G" at position 13 (G13) is found to be 6% in a 
given fimctional group, and the frequency of Gn in the total population of siRNAs is 
20%, the difference between the two values is 6%-20% = -14%. As the absolute value 
is greater than six (6), this factor (-14) is included in the equation. Thus in Formula 
VIII, in cases where the siRNA under studyhas a G in position 13, the accrued value 
is (-14) * (1) = -14i. In contrast, when a base other than G is found at position 13, the 
accrued value is (-14) * (0) = 0. 

When developing a means to optimize siRNAs, the inventors observed that a 
bias toward low internal theimodynamic stability of the duplex at the 5'-antisense 
(AS) end is characteristic of naturally occurring mlRNA precursors. The inventors 
extended this observation to siRNAs for which functionality had been assessed in 
tissue culture. 

With respect to the parameter GC15.19, a value of 0 - 5 will be ascribed 
depending on the number of G or C bases at positions 1 5 to 19. If there are only 18 
base pairs, the y;alue is between 0 and 4. 

With respect to the criterion GOotoi content, a number from 0-30 will be 
ascribed, which correlates to the total number of G and C nucleotides on the sense 
strand, excluding overhangs. Without wishing to be bound by any one theory, it is 
postulated that the significance of the GC content (as well as AU content at positions 
15-19, which is a parameter for formulas III - VII) relates to the easement of the 
unwinding of a double-stranded siRNA duplex. Duplex unwinding is believed to be 
crucial for siRNA fimctionaUty in vivo and overall low internal stability, especially 
low internal stability of the first unwound base pair is believed to be important to 
mairitain sufficient processivity of RISC complex-induced duplex unwinding. If the 
duplex has 19 base paus, those at positions 15-19 on the sense strand will unwind first 
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if the molecule exhibits a sufficiently low internal stability at that position. As 
persons skilled in the art are aware, RISC is a complex of approximately twelve 
proteins; Dicer is one, but not the only, helicase within this complex. Accordingly, 
although the GC parameters are believed to relate to activity with Dicer, they are also 
important for activity with other RISC proteins. 

The value of the parameter Tm is 0 when there are no internal repeats longer 
than (or equal to) four base pairs present in the siRNA duplex; otherwise the value is 
1. Thus for example, if the sequence ACGUACGU, or any other four nucleotide (or 
more) palindrome exists within the structure, the value will be one (1). Alternatively 
if the structure ACGGACG, or any other 3 nucleotide (or less) palindrome exists, the 
value will be zero (0)-: 

The variable "X" refers to the number of times that the same nucleotide occurs 
contiguously in a stretch of four or more units. If there are, for example, four 
contiguous As in one part of the sequence and elsewhere in the sequence four 
contiguous Cs, X =2. Further, if there are two separate contiguous stretches of four 9f 
the same nucleotides or eight or more of the same nucleotides in a row, then X=2. 
However, X does not increase for five, six or seven contiguous nucleotides. 

Again, when applying Formula VIII or Formula IX to a given mRNA, (the 
"target RNA" or "target molecule"), one may use a computer program to evaluate the 
criteria for everysse<fuence of 1 8 - 30 base pairs or only sequences of a fixed length, 
e.g., 19 base pairs. Preferably the computer program is designed such that it provides 
a report ranking of all of ttie potential siRNAs between 1 8 and 30 base pairs, ranked 
according to which sequences generate the highest value. A higher value refers to a 
more efficient siRNA for a particular target gene. The computer program that may be 
used, may be developed in any computer language that is known to be useful for 
scoring nucleotide sequences, or it may be developed with the assistance of 
commercially available product such as Microsoft's product ,net. Additionally, rather 
than run every sequence through one and/or another formula, one may compare a 
subset of the sequences, which may be desirable if for example only a subset are 
available. For instance, it may be desirable to first perform a BLAST (Basic Local 
Alignment Search Tool) search and to identify sequences that have no homology to 
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other targets. Alternatively, it may be desirable to scan the sequence and to identify 
regions of moderate GC context, then perform relevant calculations using one of the 
above-described formulas on these regions. These calculations can be done manually 
or with the aid of a computer. 

5 

As with Formulas I - VII, either Foimula VIII or Formula IX may be used for 
a given mRNA target sequence. However, it is possible that according to one or the 
other formula more than one siRNA will have the same value. Accordingly, it is 
beneficial to have a second formula by which to differentiate sequences. Formula IX 

1 0 was derived in a similar fashion as Formula VIII, yet used a larger data set and thus 
yields sequences with higher statistical correlations to highly functional duplexes. The 
sequence that has the highest value ascribed to it may be referred to as a "first 
optimized duplex." The sequence that has the second highest value ascribed to it may 
be referred to as a "second optimized duplex." Similarly, the sequences that have the 

1 5 third and fourth highest values ascribed to them may be referred to as a third 

optimized duplex and a fourth optimized duplex, respectively. When more than one 
sequence has the same value, each of them may, for example, be referred to as first 
optimized duplex sequences or co-first optimized duplexes. 

20 SiRNA sequences identified using Formula VIII are containedwithin the 

enclosed compact disks. The data included on the enclosed compact disks is 
described more fully below. The sequences identified by Formula VIII that are 
disclosed in theLCQmpae^s^disks may be used in gene silencing applications. 

25 It should be noted that for Formulas VIII and EX all of the aforementioned 

criteria are identified as positions on the sense strand when oriented in the 5' to 3' 
direction as they are identified in connection with Formulas I - VII unless otherwise 
specified. 

30 Formulas I - IX, may be used to select or to evaluate one, or more than one, 

siRNA in order to optimize silencing. Preferably, at least two optimized siR.NAs that 
have been selected according to at least one of these formulas are used to silence a 
gene, more preferably at least three and most preferably at least four. The siRNAs 
may be used individually or together in a pool or kit. Further, they may be applied to 
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a cell simultaneously or separately. Preferably, the at least two siRNAs are applied 
simultaneously. Pools are particularly beneficial for many research applications. 
However, for therapeutics, it may be more desirable to employ a single 
hyperfimctional siRNA as described elsewhere in this application. 

5 

When planning to conduct gene silencing, and it is necessary to choose 
between two or more siRNAs, one should do so by comparing the relative values 
when the siRNA are subjected to one of the formulas above. In general a higher 
scored siRNA should be used. 

10 

Useful applications include, but are not limited to, target validation, gene 
fimctionakanalysis, research and drug discovery, gene therapy and therapeutics. 
Methods for using siRNA in these applications are well known to persons of skill in 
the art 

15 

Because the ability of siRNA to function is dependent on the sequence of the 
RNA and not the species into which it is introduced, the present invention is 
applicable across a broad range of species, including but not limited to all mammalian 
species, such as humans, dogs, horses, cats, cows, mice, hamsters, chimpanzees and 
20 gorillas, as well as other species and organisms such as bactma, viruses, insects, 
plants and C elegans. 



The present Jm£e£iti£aiLis also applicable for use for silencing a broad range of 
genes, including but not limited to the rougjily 45,000 genes of a hxraian genome, and 
25 has particular relevance in cases where those genes are associated with diseases such 
as diabetes, Alzheimer's, cancer, as well as all genes in the genomes of the 
aforementioned organisms. 

The siRNA selected according to the aforementioned criteria or one of the 
30 aforementioned algorithms are also, for example, useful in the simultaneous screening 
and functional analysis of multiple genes and gene families using high througjiput 
strategies, as well as in direct gene suppression or silencing. 



Development of the Algorithms 
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To identify siRNA sequence features that promote functionality and to 
quantify the importance of certain currently accepted conventional factors— such as 
G/C content and target site accessibility— the inventors synthesized an siRNA panel 
consisting of 270 siRNAs targeting three genes. Human Cyclophilin, Firefly 
Luciferase, and Human DBI. In all three cases, siRNAs were directed against specific 
re^ons of each gene. For Human Cyclophilin and Firefly Luciferase, ninety siRNAs 
were directed against a 199 bp segment of each respective mRNA. For DBI, 90 
siRNAs were directed against a smaller, 109 base pair region of the mRNA. The 
sequences to which the siRNAs were directed are provided below. 

It should be noted that in certain sequences, "t" is present. This is because 
many- databases -contain information in this manner. However, the t denotes a uracil 
residue in mRNA and siRNA. Any algorithm will, unless otherwise specified, 
process a t in a sequence as a u. 

Human cvclophilin: 193— 39Q. M608S7 
SEQ. ID NO. 29: 

gttccaaaaacagtggataattttgtggccttagctacaggagagaaaggatttggctacaaaaacagcaaattccatcgtgt 
aatcaaggacttcatgatccagggcggagacttcaccaggggagatggcacaggaggaaagagcatctacggtgagcg 
cttccccgatgagaacttcaaactgaagcactacgggcctggctggg 

Fireflv lucif erase: 1434 — 1631. U47298 fpGL3. Prnniep f^) 
SEQ. ID NO. 30: 

tgaacttcccgccgccgttgttgttttggagcacggaaagacgatgacggaaaaagagatcgtggattacgtcgccagtca 
agtaacaaccgcgaaaaagttgcgcggaggagttgtgtttgtggacgaagtaccgaaaggtcttaccggaaaactcgacg 
caagaaaaatcagagagatcctcataaaggccaagaagg 

DBI. NM 020548 f 202-3 1 0) f every oosition^ 
SEQ. ID NO. 0031: 

acgggcaaggccaagtgggatgcctggaatgagctgaaagggacttccaaggaagatgccatgaaagcttacatcaaca 
aagtagaagagctaaagaaaaaatacggg 

A list of the siRNAs appears in Table III (see Examples Section, Example II) 



35 



The set of duplexes was analyzed to identify correlations between siRNA 
functionality and other biophysical or thermodynamic properties. When the siRNA 
panel was analyzed in functional and non-fimctional subgroups, certain nucleotides 
were much more abundant at certain positions in functional or non-fimctional groups. 
More specifically, the fi-equency of each nucleotide at each position in highly 
functional siRNA duplexes was compared with that of nonfunctional duplexes in 
order to assess the preference for or against any given nucleotide at every position. 
These analyses were used to determine unportant criteria to be included in the siRNA 
algorithms (Formulas VIII and IX). 

The data set was also analyzed for distinguishing biophysical properties of 
. siRNAs in the functional group, such as optimal percent of GC content, propensity for 
internal structures and regional thermodynamic stability. Of the presented criteria, 
several are involved in duplex recognition, RISC activation/duplex unwinding, and 
target cleavage catalysis. 

The original data set that was the source of the statistically derived criteria is 
shown in Figure 2. Additionally, this figure shows that random selection yields 
SiRNA duplexes with unpredictable and widely varying silencing potencies as 
measured in tissue culture using HEK293 cells, to the figure, duplexes are plotted 
such that each x-axis tick-mark represents an individual siRNA, with each subsequent 
SiRNA differing in target position by two nucleotides for Human Cyclophilin and 
Firefly Ludferase, and by oneau^tide for Human DBI. Furtheimore, the y-axis 
denotes the level of target expression remaining after transfection of the duplex into 
cells and subsequent silwidng of the target. 

SiRNA identified and optimized in this document work equally well in a wide 
range of cell types. Figure 3a shows the evaluation of thirty siRNAs targeting the 
DBI gene in three cell lines derived fi-om different tissues. Each DBI siRNA displays 
very similar functionality in HEK293 (ATCC, CRL-1573, human embryonic kidney), 
HeLa (ATCC, CCL-2, cervical, epithelial adenocarcinoma) and DU145 (HTB-81, 
prostate) cells as deterimined by the B-DNA assay. Thus, siRNA fimctionality is 
determined by the primary sequence of the siRNA and not by the intracellular 
environment. Additionally, it should be noted that although the present invention 
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provides for a determination of the functionality of siRNA for a given target, the same 
siRNA may silence more than one gene. For example, the complementary sequence 
of the silencing siRNA may be present in more than one gene. Accordingly, in these 
circumstances, it may be desirable not to use the siRNA with highest SMARTscore™. 
In such circumstances, it may be desirable to use the siRNA with the next highest 
SMARTscore™. 

To determine the relevance of G/C content in siRNA function, the G/C content 
of each duplex in the panel was calculated and the functional classes of siRNAs 
(<F50, > F50, > F80, > F95 where F refers to the percent gene silencing) were sorted 
accordingly. The majority of the highly-functional siRNAs ^F95) fell within the 
G/C content range of 36%--52% (Figure 3B). Twice as many non-functional (< 
F50) duplexes fell within the high G/C content groups (>57% GC content) compared 
to the 36%— 52% group. The group with extremely low GC content (26% or less) 
contained a higher proportion of non-functional siRNAs and no highly-functional 
siRNAs. The G/C content range of 30%— 52% was therefore selected as Criterion I 
for siRNA functionality, consistent with the observation that a G/C range 30%— 70% 
promotes efficient RNAi targeting. Application of this criterion alone provided only a 
marginal increase in the probability of selecting functional siRNAs from the panel: 
selection of F50 and F95 siRNAs was improved by 3.6% and 2.2%, respectively. The: 
siRNA panel presented here permitted a more systematic analysis and quantification 
of the importance of this criterion than that used previously. 

A relative measure of local internal stability is the A/U base pair (bp) content; 
therefore, the frequency of A/U bp was determined for each of the five terminal 
positions of the duplex (5' sense (S)/5' antisense (AS)) of all siRNAs in the panel. 
Duplexes were then categorized by the number of A/U bp in positions 1 — ^5 and 15 — 
19 of the sense strand. The thermodynamic flexibility of the duplex 5'-end (positions 
1 — 5; S) did not appear to correlate appreciably with silencing potency, while that of 
the 3'-end (positions 15—19; S) correlated with efficient silencing. No duplexes 
lacking A/U bp in positions 15—19 were functional. The presence of one A/U bp in 
this region conferred some degree of functionality, but the presence of three or more 
A/Us was preferable and therefore defined as Criterion II. When applied to the test 
panel, only a marginal increase in the probability of functional siRNA selection was 
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achieved: a 1.8% and 2.3% increase for F50 and F95 duplexes, respectively (Table 
IV), 

The complementary strands of siRNAs that contain internal repeats or 
5 palindromes may form internal fold-back structures. These hairpin-like structures 
exist in equilibrium with the duplexed form effectively reducing the concentration of 
functional duplexes. The propensity to form internal hairpins and their relative 
stability can be estimated by predicted melting temperatures. High Tm reflects a 
tendency to form hairpin structures. Lower Tm values indicate a lesser tendency to 

10 form hairpins. When the functional classes of siRNAs were sorted by Tn^ (Figure 3c), 
the following trends were identified: duplexes lacking stable internal repeats were the 
most potent silencers (no F95 duplex with predicted hairpin structure T« > 60 ^C). In 
contrast, about 60% of the duplexes in the groups having internal hairpins with 
calculated values less than 20 were F80. Thus, the stability of internal repeats 

15 is inversely proportional to the silencing effect and defines Criterion III (predicted 
hairpin structure T« < 20 ""C). 

Sequence- based determinants of siRNA fimctionalitv 

When the siRNA panel was sorted into functional and non-functional groups, 

20 the frequency of a specific nucleotide at each position in a functional siRNA duplex 
was compared with that of a nonfunctional duplex in order to assess the preference for 
or against a certain nucleotide. Figure 4 shows the results of these queries and the 
subsequent resorting of the data set .(fix)m;Eigure 2). The data is separated into two 
sets: those duplexes that meet the criteria, a specific nucleotide in a certain position - 

25 grouped on the left (Selected) and those that do not - grouped on the right 

(Eluninated). The duplexes are further sorted from most functional to least functional 
with the y-axis of Figure 4a-e representing the % expression i.e. the amount of 
silencing that is elicited by the duplex (Note: each position on the X-axis represents a 
different duplex). Statistical analysis revealed correlations between silencing and 

30 several sequence-related properties of siRNAs. Figure 4 and Table IV show 

quantitative analysis for the following five sequence-related properties of siRNA: (A) 
an A at position 19 of the sense strand; (B) an A at position 3 of the sense strand; (C) 
a U at position 10 of the sense strand; (D) a base other than G at position 13 of the 
sense strand; and (E) a base other than C at position 19 of the sense strand. 
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When the siRNAs in the panel were evaluated for the presence of an A at 
position 19 of the sense strand, the percentage of non-functional duplexes decreased 
from 20% to 1 1 .8%, and the percentage of F95 duplexes increased from 2 1 .7% to 
5 29.4% (Table IV). Thus, the presence of an A in this position defined Criterion IV. 

Another sequence-related property correlated with silencing was the presence 
of an A in position 3 of the sense strand (Figure 4b). Of the siRNAs with A3, 34.4% 
were F95, compared with 21 .7% randomly selected siRNAs. The presence of a U 
10 base in position 10 of the sense strand exhibited an even greater impact (Figure 4c). 
Of the duplexes in this group, 41 .7% were F95. These properties became criteria V 
and VI, respectively. 



Two negative sequence-related criteria that were identified also appear on 
1 5 Figure 4. The absence of a G at position 1 3 of the sense strand, conferred a margmal 
increase in selecting fimctional duplexes (Figure 4d). Similarly, lack of a C at 
position 19 of the sense strand also correlated with fimctionality (Figure 4e), Thus, 
among functional duplexes, position 19 was most likely occupied by A, and rarely 
occupied by C. These rules were defined as criteria VII and VIII, respectively. 

20 

Application of each criterion individually provided marginal but statistically 
significant increases in the probability of selecting a potent siRNA. Although the 
results were informative, the inventors soughit4o«maximize potency and therefore 
consider multiple criteria or parameters. Optimization is particularly important when 
25 developing tfierapeutics. Interestingly, the probability of selecting a fimctional siRN A 
based on each tiiermodynamic criteria was 2% — 4% higher than random, but 4% — 
8% higher for the sequence-related determinates. Presumably, these sequence-related 
increases reflect the complexity of the RNAi mechanism and the multitude of protein- 
RNA interactions that are involved in RNAi-mediated silencing. 

30 
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Table IV 
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25 

The siRNA selection aleorithm 

In an effort to improve selection further, all identified criteria, including but 
not limited to those listed in Table IV were combined into the algorithms embodied in 
Formula VIII and Formula EX. Each siRNA was then assigned a score (referred to as 
30 a SMARTscore™) according to the values derived firom the formulas. Duplexes that 
scored higher than 0 or 20, for Formulas VIII and IX, respectively, effectively 
selected a set of functional siRNAs and excluded all non-functional siRNAs. 
Conversely, all duplexes scoring lower than 0 and 20 according to formulas VIII and 
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IX, respectively, contained some functional siRNAs but included all non-functional 
siRNAs. A graphical representation of this selection is shown in Figure S. 

The methods for obtaining the seven criteria embodied in Table IV are 
5 illustrative of the results of the process used to develop the infonnation for Formulas 
VIII and DC. Thus similar techniques were used to establish the other variables and 
their multipliers. As described above, basic statistical methods were use to determine 
the relative values for these multipliers. 

1 0 To determine the value for "hnprovement over Random" the difference in the 

frequency of a given attribute (e,g. GC content, base preference) at a particular 
position is determined between individual functional groups (e.g. <F50) and the total 
siRNA population studied (e.g. 270 siRNA molecules selected randomly). Thus, for 
instance, in Criterion I (30%-52% GC content) members of the <F50 group were 

15 observed to have GC contents between 30-52% in 16.4% of the cases. In contrast, tiie 
total group of 270 siRNAs had GC contents in this range, 20% of the time. Thus for 
this particular attribute, there is a small negative correlation between 30%-52% GC 
content and this functional group (z.e. 16.4%-20% = -3.6%). Similarly, for Criterion 
VI, (a "U" at position 10 of the sense strand), the >F95 group contained a "U" at this 

20 position 41 :7% of the time. In contrast, the total group of 270 siRNAs had a "U" at 
this position 21.7% of the time, thxis the improvement over random is calculated to be 
20%(or41.7%-21.7%). 

Identifying The Average Internal Stabilitv Profile of Strong siRNA 
25 In order to identify an internal stability profile that is characteristic of strong 

siRNA, 270 different siRNAs derived from the cyclophilin B, the diazepam binding 
inhibitor (DBI), and the luciferase gene were individually transfected into HEK293 
cells and tested for their ability to induce RNAi of the respective gene. Based on their 
performance in the in vivo assay, the sequences were then subdivided into three 
30 groups, (i) >95% silencing; (ii) 80-95% silencing; and (iii) less than 50% silencing. 
Sequences exhibiting 51-84% silencing were eKminated from fiirther consideration to 
reduce the difficulties in identifying relevant thermodynamic patterns. 
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Following the division of siRNA into three grovq>s, a statistical analysis was 
perfonned on each member of each group to determine the average internal stability 
profile (AISP) of the siRNA. To accomplish this the Oligo 5.0 Primer Analysis 
Software and other related statistical packages {e.g. Excel) were exploited to 
5 determine the internal stability of pentamers using the nearest neighbor method 

described by Freier et al, (1986) Improved free-energy parameters for predictions of 
UNA duplex stability, Proc Natl. Acad. Sci. U. S. A. 83(24): 9373-7. Values for each 
group at each position were then averaged, and the resulting data were graphed on a 
linear coordinate system with the Y-axis expressing the AG (free energy) values in 
10 kcal/mole and the X-axis identifying the position of the base relative to the 5' end. 

The results of the analysis identified iiaultiple key regions in siRNA molecules 
that were critical for successful gene silaicing. At the 3'-most end of the sense strand 
(5'antisense), highly functional siRNA (>95% gene silencing, see Figure 6a, >F95) 

15 have a low internal stability (AISP of position 19 = ~ -7.6kcal/mol). In contrast low- 
efficiency siRNA (i.e. those exhibiting less than 50% silencing, <F50) display a 
distinctly different profile, having high AG values (~ -8.4kcaymol) for the same 
position. Moving in a 5' (sense strand) direction, the internal stability of highly 
efficient siRNA rises (position 12 = ~ -8.3kcal/mole) and then drops again (position 7 

20 = ~ -7.7kcal/niol) before leveling off at a value of approximately -8.1kcal/mol for the 
5' terminus. SiRNA with poor silencing capabilities show a distinctly different 
profile. While the AISP value at position 12 is nearly identical with that of strong 
siRNAs, the values at positions 7 and 8 rise considerabl^> peaking at a high of - -9.0 
kcal/mol. In addition, at the 5' end of the molecule the AISP profile of strong and 

25 weak siRNA differ dramatically. Unlike the relatively strong values exhibited by 

siRNA in the >95% silaicing group, siRNAs that exhibit poor silencing activity have 
weak AISP values (-7.6, -7.5, and -7.5 kcal/mol for positions 1, 2 and 3 respectively). 

Overall the profiles of both strong and weak siRNAs form distinct sinusoidal 
30 shapes that are roughly 1 80** out-of-phase with each otho-. While these 

thermodynamic descriptions define the archetypal profile of a sti-ong siRNA, it will 
likely be the case that neither the AG values given for key positions in the profile or 
the absolute position of the profile along the Y-axis {i.e. the AG -axis) are absolutes. 
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Profiles that are shifted upward or downward (i.e having on an average, higher or 
lower values at every position) but retain the relative shape and position of the profile 
along the X-axis can be foreseen as being equally effective as the model profile 
described here. Moreover, it is likely that siRNA that have strong or even stronger 
5 gene-specific silencing effects might have exaggerated AG values (either higher or 
lower) at key positions. Thus, for instance, it is possible that the 5'-most position of 
the sense strand (position 19) could have AG values of 7.4 kcal/mol or lower and still 
be a strong siRNA if, for instance, a G-C ^ G-TAJ mismatch were substituted at 
position 19 and altered duplex stability. Similarly, position 12 and position 7 could 

1 0 have values above 8.3 kcal/mol and below 7.7 kcal/mole, respectively, without 

abating the silencing effectiveness of the molecule. Thus, for instance, at position 12, 
a stabilizing chemical modification (e.g. a chemical modification of the T position of 
the sugar backbone) could be added that increases the average intemal stability at that 
position. Similarly, at position 7, mismatches similar to those described previously 

1 5 could be introduced that would lower the AG values at that position. 

Lastly, it is important to note that while fimctional and non-fimctional siRNA 
were originally defined as those molecules having specific silencing properties, both 
broader or more limiting parameters can be used to define these molecules. As used 

20 herein, unless otherwise specified, "non-fimctional siRNA" are defined as those 

siRNA that induce less than 50% (<50%) target silencing, "semi-fimctional siRNA" 
induce 50-79% target silencing, "fiinctional siRNA" are molecules that induce 80- 
95% gene silencing, and "highly-fimctional siRNA" are irfdlgcules that induce great 
than 95% gene silencing. These definitions are not intended to be rigid and can vary 

25 depending upon the design and needs of the application. For instance, it is possible 
that a researcher attempting to map a gene to a chromosome using a fimctional assay, 
may identify an siRNA that reduces gene activity by only 30%. While this level of 
gene silencing may be "non-fimctional" for e,g. therapeutic needs, it is sufficient for 
gene mapping purposes and is, under these uses and conditions, "functional." For 

30 these reasons, fimctional siRNA can be defined as those molecules having greater 
than 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% silencing capabilities at 
lOOnM transfection conditions. Similarly, depending upon the needs of the study 
and/or application, non-fimctional and semi-fiinctional siRNA can be defined as 
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having different parameters. For instance, semi-functional siRNA can be defined as 
being those molecules that induce 20%, 30%, 40%, 50%, 60%, or 70% silencing at 
lOOnM transfection conditions. Similarly, non-functional siRNA can be defined as 
being those molecules that silence gene expression by less than 70%, 60%, 50%, 
5 40%, 30%, or less. Nonetheless, unless otherwise stated, the descriptions stated in 
the "Definitions" section of this text should be applied. 

Functional attributes can be assigned to each of the key positions in the AISP 
of strong siRNA. The low 5' (sense strand) AISP values of strong siRNAs maybe 
1 0 necessary for determining which end of the molecule enters the RISC complex. In 

contrast, the high and low AISP values observed in the central regions of the molecule 
maybe critical for siRNA-target mRNA interactions iand product release, 
respectively. 

15 If the AISP values described above accurately define the theraiodynamic 

parameters of strong siRNA, it would be expected that similar patterns would be 
observed in strong siRNA isolated from nature. Natural siRNAs exist in a harsh, 
RNase-rich environment and it can be hypothesized that only those siRNA that 
exhibit heightened afiSnity for RISC (i.e. siRNA that exhibit an average internal 

20 stability profile similar to those observed in strong siRNA) would survive in an 
intracellular environment. This hypothesis was tested using GFP-specific siRNA 
isolated from N. benthamiana. Llave et al. (2002) Endogenous and Silencing- 
Associated Small RNAs in Plants, The Plant Cell 14, 1605-l&l5ifintroduced long 
double-stranded GFP-encoding RNA into plants and subsequently re-isolated GFP- 

25 specific siRNA from the tissues. The AISP of fifty-nine of these GFP-siRNA were 
determined, averaged, and subsequently plotted alongside the AISP profile obtained 
from Ae cyclophilin B/DBI/ luciferase siRNA having >90% silencing properties 
(Figure 6b). Comparison of the two groups show that profiles are nearly identical. 
This finding validates the information provided by the internal stability profiles and 

30 demonstrates that: (1) the profile identified by analysis of the cyclophilin B/DBI/ 

luciferase siRNAs are not gene specific; and (2) AISP values can be used to search for 
strong siRNAs in a variety of species. 
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Both chemical modifications and base-pair mismatches can be incorporated 
into siRNA to alter the duplex's AISP and functionality. For instance, introduction of 
mismatches at positions 1 or 2 of the sense strand destabilized the 5'end of the sense 
strand and increases the functionality of the molecule (see Luc, Figure 7). Similarly, 
5 addition of 2'-0-methyl groups to positions 1 and 2 of the sense strand can also alter 
the AlSP and (as a result) increase both the functionality of the molecule and 
eliminate off-target effects that results fi-om sense strand homology with the unrelated 
targets (Figures 8a, 8b). 

10 Rationale for Criteria in a Biological Context 

The fate of siRNA in the RNAi pathway may be described in 5 major steps: 
; : (1) duplex recognition and pre-RISC complex formation; (2) ATP-dependent duplex 
unwinding/strand selection and RISC activation; (3) mRNA target identification; (4) 
mRNA cleavage, and (5) product release (Figure 1). Given the level of nucleic acid- 

1 5 protein interactions at each step, siRNA fiinctionality is likely influenced by specific 
biophysical and molecular properties that promote efficient interactions within the 
context of the multi-component complexes. Indeed, the systematic analysis of the 
siRNA test set identified multiple factors that correlate well with functionality. When 
combined into a single algorithm, they proved to be very effective in selecting active 

20 siRNAs. 

The factors described here may also be predictive of key functional 
associations important for each step in RNAi. For example, theppteaatial formation of 
intemal hairpin structures correlated negatively with siRNA functionality, 

25 Complementary strands with stable intemal repeats are more likely to exist as stable 
hairpins thus decreasing the effective concentration of the functional duplex form. 
This suggests that the duplex is the preferred conformation for initial pre-RISC 
association. Indeed, although single complementary strands can induce gene 
silencing, the effective concentration required is at least two orders of magnitude 

30 higher than that of the duplex form. 

siRNA-pre-RISC complex formation is followed by an ATP-dependent duplex 
unAvinding step and "activation" of the RISC. The siRNA fiinctionality was shown to 
correlate with overall low intemal stability of the duplex and low intemal stability of 
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the 3' sense end (or differential internal stability of the 3' sense compare to the 5' 
sense strand), which may reflect strand selection and entry into the RISC. Overall 
duplex stability and low intemal stability at the 3' end of the sense strand were also 
correlated with siRNA functionality. Interestingly, siRNAs with very high and very 
5 low overall stabiUty profiles correlate strongly with non-functional duplexes. One 
interpretation is that high intemal stability prevents efficient unwinding while very 
low stability reduces siRNA target affinity and subsequent mRNA cleavage by the 
RISC. 

10 Several criteria desoribe base preferences at specific positions of the sense 

strand and are even more intriguing when considering their potential mechanistic 
roles m target recognition and mRNA cleavage. Base preferences for A at position 19 
of the sense strand but not C, are particularly interesting because they reflect the same 
base preferences observed for naturally occurring miRNA precursors. That is, among 

15 the reported miRNA precursor sequences 75% contain a U at position 1 which 

corresponds to an A in position 19 of the sense strand of siRNAs, while G was under- 
represented in this same position for miRNA precursors. These observations support 
the hypothesis that both miRNA precursors and siRNA duplexes are processed by 
very similar if not identical protein machinery. The functional interpretation of the 

20 predominance of a U/A basapair is that it promotes flexibility at the 5'antisense ends 
of both siRNA duplexes and miRNA precursors and facilitates efficient unwinding 
iand selective strand entrance into an activated RISC. 

Among the criteria associated with base preferences that are likely to 
25 influfflce mRNA cleavage or possibly product release, the preference for U at 
position 10 of the sense strand exhibited the greatest impact, enhancing the 
probability of selecting an F80 sequence by 1 3.3%. Activated RISC preferentially 
cleaves target mRNA between nucleotides 10 and 11 relative to the 5' end of the 
complementary targeting strand. Therefore, it may be that U, the preferred base for 
30 most endoribonucle^ses, at this position supports more efficient cleavage. 

Alternatively, a U/A bp between the targeting siRNA strand and its cognate target 
mRNA may create an optimal conformation for the RISC-associated "slicing" 
activity. 
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Pooling; 

According to another embodiment, the present invention provides a pool of at 
least two siRNAs, preferably in the form of a kit or therapeutic reagent, wherein one 
strand of each of the siRNAs, the saise strand comprises a sequence that is 
5 substantially similar to a sequence within a target mRNA. The opposite strand, the 
antis«ise strand, will preferably comprise a sequence that is substantially 
complementary to that of the target mRNA. More preferably, one strand of each 
siRNA will comprise a sequence that is identical to a sequence that is contained in the 
target mRNA. Most preferably, each siRNA will be 19 base pairs in length, and one 
10 strand of each of the siRNAs will be 100% complementary to a portion of the target 
mRNA. 

By increasing the numbear of siRNAs directed to a particular target using a 
pool or kit, one is able both to increase the likelihood that at least one siRNA with 

15 satisfactory functionality will be included, as well as to benefit from additive or 

synergistic effects. Further, when two or more siRNAs directed against a single gene 
do not have satisfactory levels of functionality alone, if combined, they may 
satisfactorily promote degradation of the target messenger RNA and successfiiUy 
inhibit translation. By including multiple siRNAs in the system, not only is the 

20 probability of silencing increased, but the economics of operation are also improved 
when compared to adding different siRNAs sequentially. This effect is contrary to the 
conventional wisdom that the concurrent use of multiple siRNA will negatively 
impact gene silencing (e.g. Holen, T. etal. (2003) "Similar behavior of^iDgle;strand 
and double shand siRNAs suggests they act through a common RNAi pathway." 

25 NAR 31: 2401-21407). 

hi fact, when two siRNAs were pooled together, 54% of the pools of two 
siRNAs induced more than 95% gene silencing. Thus, a 2.5-fold increase in the 
percentage of functionality was achieved by randomly combining two siRNAs. 
30 FurthCT, over 84% of pools containingtwo siRNAs induced more than 80% gene 
silencing. 

More preferably, the kit is comprised of at least three siRNAs, wherein one 
strand of each siRNA comprises a sequaice that is substantially similar to a sequence 



of the target mRNA and the other strand comprises a sequence that is substantially 
complementary to the region of the target mRNA. As with the kit that comprises at 
least two siRNAs, more preferably one strand will comprise a sequence that is 
identical to a sequaice tiiat is contained in the mRNA and another strand that is 1 00% 
5 complementary to a sequence that is contained in the mRNA. During experiments, 
when three siRNAs were combined together, 60% of the pools induced more than 
95% gene silencing and 92% of the pools induced more than 80% gene silencing. 

Further, even more preferably, tiie kit is comprised of at least four siRNAs, 
1 0 wherein one strand of each siRNA comprises a sequence that is substantially similar 
to a region of the sequence of the target mRNA, and the other strand comprises a 
sequence that is substantially compl«nentary to the region of the target mRNA. As 
with the kit or pool that comprises at least two siRNAs, more preferably one strand of 
each of the siRNA duplexes will comprise a sequence that is identical to a sequence 
1 5 that is contained in the mRNAj. and another strand that is 1 00% complementary to a 
sequence that is contained in the mRNA. 

Additionally, kits and pools with at least five, at least six, and at least seven 
siRNAs may also be useful with the present invention. For example, pools of five 

20 siRNA induced 95% gene silencingjvith 77% probability and 80% silencing with 
98.8% probabiUty. Thus, pooling of siRNAs together can result in the creation of a 
target-specific silencing reagent with ahnost a 99% probability of being fimctional. 
The fact that sudi high levels of success are achievable using such pools of siRNAi^^. 
enables one to dispense with costly and time-consuming target-specific validation 

25 procedures. 

For this embodiment, as well as the other aforementioned embodiments, each 
of the siRNAs within a pool will preferably comprise between 1 8 and 30 base pairs, 
more preferably between 1 8 and 25 base pairs, and most preferably 19 base pairs. 
30 Within each siRNA, preferably at least 18 contiguous bases of the antisense strand 
will be 100% complementary to the target mRNA. More preferably, at least 19 
contiguous bases of the antisense strand will be 100% complementary to die target 
mRNA. Additionally, fliere may be overhangs on either the sense strand or the 
antisense strand, and these overhangs may be at either the 5' end or the 3' end of 
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either of the strands, for example there may be one or more overhangs of 1-6 bases. 
When overhangs are present, they are not included in the calculation of the number of 
base pairs. The two nucleotide 3' overhangs mimic natural siRNAs and are 
commonly used but are not essential. Preferably, the overhangs should consist of two 
5 nucleotides, most often dTdT or UU at the 3' end of the sense and antisense strand 
that are not complementary to the target sequence. The siRNAs may be produced by 
any method that is now known or that comes to be known for synthesizing double 
stranded RNA that one skilled in the art would appreciate would be useful in the 
present invention. Preferably, the siRNAs will be produced by Dharmacon's 
1 0 proprietary ACE® technology. However, other methods for synthesizing siRNAs are 
well known to persons skilled in the art and include, but are not limited to, any 
chemical synthesis of RNA oligonucleotides, ligation of shorter oligonucleotides, in 
vitro transcription of RNA oligonucleotides, flie use of vectors for expression within 
cells, recombinant Dicer products and PGR products. 

15 

The siRNA duplexes within the aforementioned pools of siRNAs may 
correspond to overlapping sequences within a particular mRNA, or non-overlapping 
sequences of the mRNA. However, preferably they correspond to non-overlapping 
sequences. Further, each siRNA may be selected randomly, or one or more of the 
20 siRNA may be selected according to the;mteria discussed above for maximizing the 
effectiveness of siRNA. 



* - Included in the definition of siRNAs are siRNAs that contain substituted . v . : . . 

and^or labeled nucleotides that may, for example, be labeled by radioactivity, 

25 fluorescence or mass. The most common substitutions are at the 2' position of the 
ribose sugar, where moieties such as H (hydrogen) F, NH3, OCH3 and other O- alkyl, 
alkenyl, alkynyl, and orthoesters, may be substituted, or in the phosphorous backbone, 
where sulfur, amines or hydrocarbons may be substituted for the bridging of non- 
bridging atomsjn the phosphodiester bond. Examples of modified siRNAs are 

30 explained more fully in commonly assigned U.S. Patent Application Sen No. 
10/613,077, filed July 1, 2003, which is incorporated by reference herein. 

Additionally, as noted above, the cell type into which the siRNA is introduced 
may affect the ability of the siRNA to enter die cell; however, it does not appear to 
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affect the ability of the siRNA to function once it enters the cell. Methods for 
introducing double-stranded RNA into various cell types are well known to pe 
skilled in the art. 



As persons skilled in the art are aware, in certain species, the presence of 
proteins such as RdRP, the RNA-dependent RNA polymerase, may catalytically 
enhance the activity of the siRNA. For example, RdRP propagates the RNAi effect in 
C. elegans and other non-mammalian organisms. In fact, in organisms that contain ^ 
these proteins, the siRNA may be inherited. Two other proteins that are well studied 
and known to be a part of the machinery are members of the Argonaute family and 
Dicer, as well as their homologues. There is also initial evidence that the RISC 
complex might be associated with the ribosome so the more efficiently translated 
mRNAs will be more susceptible to silencing than othra-s. 

Another very important factor in the efficacy of siRNA is mRNA localization. 
In general, only cytoplasmic mRNAs are considered to be accessible to RNAi to any 
appreciable degree. However, appropriately designed siRNAs, for example, siRNAs 
modified with intemucleotide linkages, may be able to cause silencing by acting in the 
nucleus. Examples of these types of modifications are descaibed in commonly 
assigned U.S. Patent Application Serial Nos.:a0/43 1,027 and 10/613,077, each of 
which is incorporated by reference herein. 

As described above, even when one selects at least two siRNAs at landom^the- 
effectiveness of the two may be greater than one would predict based on the 
effectiveness of two individual siRNAs. This additive or synergistic effect is 
particularly noticeable as one increases to at least three siRNAs, and even more 
noticeable as one moves to at least four siRNAs. Surprisingly, the pooling of the non- 
functional and semi-functional siRNAs, particularly more than five siRNAs, can lead 
to a silencmg mixture that is as effective if not more effective than any one particular 
functional siRNA. 
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Within the kit of the presait invention, preferably eadi siRNA will be present 
in a concentration of between 0.001 and 200 ^M, more preferably between 0.01 and 
200 nM, and most preferably between 0.1 and 10 nM. 

In addition to preferably comprising at least four or five siRNAs, the kit of the 
present invention will also preferably comprise a buffer to keep the siRNA duplex 
stable. Persons skilled in the art are aware of buffers suitable for keeping siRNA 
stable. For example, the buffer may be comprised of 100 mM KCl, 30 mM HEPES- 
pH 7.5, and 1 mM MgClz. Alternatively, kits might contain complementary strands 
that contain any one of a number of diemical modifications (e.g. a 2'-0-ACE) that 
protect the agents fiom degradation by nucleases. In this instance, the user may (or 
may not) remove the modifying protective group ie.g. deprotect) before annealing the 
two complementary strands together. 

By way of example, the kit may be organized such that pools of siRNA 
di^lexes are provided on an airay or microarray of wells or drops for a particular 
gene set or for unrelated genes. TTie array may, for example, be in 96 wells, 384 wells 
or 1284 wells arrayed in a plastic plate or on a glass slide using techniques now 
known or that come to be known to persons skilled in the art. Within an airay, 
preferably there will be controls such as fimctiottSd anti-lamin A/C, cyclophili^ and 
two SiRNA duplexes that are not specific to the gene of interest. 

In order to ensure stability of the SiRNA pools prior to usage, they may be 
retained in lyophilized form at minus twenty degrees (-20°C) until they are ready for 
use. Prior to usage, they should be resuspended; however, even once resuspended, for 
example, in the aforementioned buffer, they should be kept at minus twenty degrees, 
(-20°C) until used. The aforementioned buffer, prior to use, may be stored at 
approximately 4-C or room temperature. Effective temperatures at which to conduct 
transfections are well known to persons skilled in the art and include for example, • 
room temperature. 

The kit may be applied either in vivo or in vitro. Preferably, the siRNA of the 
pools or kits is applied to a ceU through transfection, employing standard transfection 
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protocols. These methods are well known to persons skilled in the art and include the 
use of lipid-based carriers, electroporation, cationic carriers, and microinjection. 
Further, one could apply the present invention by synthesizing equivalent DNA 
sequences (either as two separate, complementary strands, or as haiipin molecules) 
instead of siRNA sequences and introducing them into cells through vectors. Once in 
the cells, the cloned DNA could be transcribed, thereby forcing the cells to generate 
the SiRNA. Examples of vectors suitable for use with the present application include 
but are not limited to the standard transient expression vectors, adenoviruses, 
retroviruses, lentivirus-based vectors, as well as other traditional expression vectors. 
Any vector that has an adequate siRNA expression and procession module may be 
used. Furthermore, certain chemical modifications to siRNAs, including but not 
limited to conjugations to other molecules, may be used to facilitate delivery. For 
certain applications it may be preferable to deliver molecules witiiout h^sfection by 
simply formulating in a physiological acceptable solution. 

This embodiment may be used in connection with any of the aforementioned 
embodiments. Accordingly, flie sequences within any pool may be selected by 
rational design. 



Multigene SUencing 

In addition to developing kits that contain multiple siRNA directed against a 
: . single gene, another embodiment includes the use of multiple siRNA targeting 
multiple genes. Multiple genes may be targeted through the use of high- or hyper- 
functional SiRNA. High- or hyper- functional siRNA that exhibit increased potency, 
require lower concenti-ations to induce desired phenotypic (and thus therapeutic) 
effects. This circumvents RISC saturation. It therefore reasons tiiat if lower 
concentrations of a single siRNA are needed for knockout or knockdown expression 
of one gene, then the remaining (uncomplexed) RISC will be fi-ee and available to 
interact with siRNA directed against two, three, four, or more, genes. Thus in this 
embodiment, the autiiors describe the use of highly functional or hyper-functional 
SiRNA to knock out three separate genes. More preferably, such reagents could be 
combined to knockout four distinct genes. Even more preferably, highly functional 
hyperfunctional siRNA could be used to knock out five distinct genes. Most 



or 
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preferably, siRNA of this type could be used to knockout or knockdown the 
expression of six or more genes. 

Hvperfmictional siRNA 

5 The term hyperftinctional siRNA (hf-siRNA) describes a subset of the siRNA 

population that induces RNAi in cells at low- or sub-nanomolar concentrations for 
extended periods of time. These traits, heightened potency and extended longevity of 
the RNAi phenotype, are highly attractive from a therapeutic standpoint. Agents 
having higher potency require lesser amounts of the molecule to achieve the desked 
1 0 physiological response, thus reducing the probability of side effects due to "off- 
target*' interference. In addition to the potential therapeutic benefits associated with 
hyperfunctional siRNA, hf-siRNA are also desirable from^economic position. 
Hyperfunctional siRNA may cost less on a per-treatment basis, thus reducing the 
overall expenditures to both the manufacturer and the consumer. 

1 5 Identification of hyperfunctional siRNA involves multiple steps that are 

designed to examine an individual siRNA agent's concentration- and/or longevity- 
profiles. In one non-limiting example, a population of siRNA directed against a single 
gene are first analyzed using the previously described algorithm (Formula VIII). 
Individual siRNA are then introduced into a test cell line and assessed for the ability 

20 to degrade the target mRNA. It is important to note that when performing this step it 
is not necessary to test all of the siRNA. Instead, it is sufficient to test only those 
siRNA having the highest SMARTscoresT'^* (Le, SMARTscore™ > -10). 
Subsequently, the gene silencing data is plotted against the SMARTscores'^^ (see 
Figure 9). SiRNA that (1) induce a high degree of gene silencing {Le. they induce 

25 greater than 80% gene knockdown) and (2) have superior SMARTscores™ (Le. a 
SMARTscore™ of > -10, suggesting a desirable average internal stability profile) are 
selected for fiirther investigations designed to better xmderstand the molecule's 
potency and longevity. In one, non-limiting study dedicated to understanding a 
molecule's potency, an siRNA is introduced into one (or more) cell types in 

30 increasingly diminishing concentrations (e.g. 3.0 0.3 nM). Subsequently, the level 
of gene silencing induced by each concentration is examined and siRNA that exhibit 
hyperfunctional potency (Le, those that induce 80% silencing or greater at e,g. 
picomolar concentrations) are identified. In a second study, the longevity profiles of 
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siRNA having high (>-10) SMARTscores™ and greater than 80% silencing are 
examined. In one non-limiting example of how this is achieved, siRNA are introduced 
into a test cell line and the levels of RNAi are measured over an extended period of 
time ie,g, 24-168 hrs). SiRNAs that exhibit strong RNA interference patterns (i.e. 
5 >80 % interference) for periods of time greater than, e.g., 120 hours are thus 

identified. Studies similar to those described above can be performed on any and all 
of the >10^ siRNA included in this document to further define the most functional 
molecule for any given gene. Molecules possessing one or both properties (extended 
longevity aad heightened potency) are labeled "hyperfunctional siRNA," and 
10 earmarked as candidates for future therapeutic studies. 

While the example(s) given above describe one^^^ans by which 
hyperfunctional siRNA can be isolated, neither the assays themselves nor the 
selection parameters used are rigid and can vary with each family of siRNA. Families 
15 of siRNA include siRNAs directed against a single gene, or directed against a related 
family of genes* 

The highest quality siRNA achievable for any given gene may vary 
considerably. Thus, for example, in the case of one gene (gene X), rigorous studies 

20 such as those described above may enable the identification of an siRNA that, at 
picomolar concentrations, induces 99^% silencing for a period of 10 days. Yet 
identical studies of a second gene (gene Y) may yield an siRNA that at high 
nanomolar concentrations (e.g. lOOnM) induces only 75% silencing for a period of 2 
days. Both molecules represent the very optimum siRNA for their respective gene 

25 targets and therefore are designated "hyperfunctional/' Yet due to a variety of factors 
including but not linwted to target concentration, siRNA stability, cell type, off-target 
interferaice, and others, equivalent levels of potency and longevity are not 
achievable. Thus, for these reasons, the parameters described in the before mentioned 
assays, can vary. While the initial screen selected siRNA that had SMARTscores*^^ 

30 above -1 0 and a gene silencing capability of greater than 80%, selections that have 
stronger (or weaka:) parameters can be implemented. Similarly, m the subsequent 
studies designed to identify molecules with high potency and longevity, the desired 
cutoff criteria (i.e. the lowest concentration that induces a desirable level of 
interference, or the longest period of time that interference can be observed) can vary. 
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The experimentation subsequent to application of the rational criteria of this 
application is significantly reduced where one is trying to obtain a suitable 
hyperfijnctional siRNA for, for example, therapeutic use. When, for example, the 
additional experimentation of the type described herein is applied by one skilled in flie 
art with this disclosure in hand, a hyperfonctional siRNA is readily identified. 

The siRNA may be introduced into a cell by any method that is now known or 
that comes to be known and that from reading this disclosure, persons skilled in the 
art would determine would be useful in connection with the present invention in 
enabling siRNA to cross the cellular membrane. These methods include, but are not 
limited to, any manner of transfection, such as for example transfection employing 
DEAE-Dextran, calcium phosphate, cationic lipidMq)osomes, micelles, manipulation 
of pressure, microinjection, electroporation, inmiunoporation, use of vectors such as 
viruses, plasmids, cosmids, bacteriophages, cell fusions, and coupling of the 
polynucleotides to specific conjugates or ligands such as antibodies, antigens, or 
receptors, passive introduction, adding moieties to the siRNA that facilitate its uptake, 
and the like. 



Having described the invention with a degree of particularity, examples will 
now be provided. These examples are not intended to and should not be construed to 
limit the scope of the claims in any way. 

i i., Examples 
General Techniques and Nomenclatures 

SiRNA nomenclature. All siRNA duplexes are referred to by sense strand. The first 
nucleotide of the 5'-end of the sense strand is position 1 , which corresponds to 
position 19 of the antisense strand for a 19-mer. In most cases, to compare results 
from different experiments, silencing was determined by measuring specific transcript 
mRNA levels or enzymatic activity associated with specific transcript levels, 24 hours 
post-transfection, with siRNA concentrations held constant at 100 nM. For all 
experiments, unelss otherwise specified transfection efficiency was ensured to be over 
95%, and no detectable cellular toxicity was observed. The following system of 
nomenclature was used to compare and report siRNA-silencing functionality: "F" 
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followed by the degree of minimal knockdown. For example, F50 signifies at least 
50% knockdown, F80 means at least 80%, and so forth. For this study, all sub-F50 
siRNAs were considered non-functional. 

5 Cell culture and transfection. 96-well plates are coated with 50 \il of 50 mg/ml poly- 
L-lysine (Sigma) for 1 hr, and then washed 3X with distilled water before being dried 
for 20 min. HEK293 cells or HEK293Lucs or any other cell type of interest are 
released fi-om their solid support by trypsinization, diluted to 3.5 X 10^ cells/ml, 
followed by the addition of 100 jiL of cells/well. Plates are then incubated overnight 

10 at 37° C, 5% CO2. Transfection procedures can vary widely depending on the cell 
type and transfection reagents. In one non-limiting example, a transfection mixture 
consisting of 2 mL Opti-MEM I (Gibco-BKE^O' jittipofectamine 2000 
(Invitrogen), 15 SUPERNasin at 20 U/fil (Ambion), and 1.5 ^il of reporter gene 
plasmid at 1 ^g/ [il is prepared in 5-ml polystyrene round bottom tubes. 100 \il of 

15 transfection reagent is dien combined with 100 \i\ of siRNAs in polystyrene deep-well 
titer plates (Beckman) and incubated for 20 to 30 min at room temp. 550 ^il of Opti- 
MEM is then added to each well to bring the final siRNA concentration to 100 nM. 
Plates are then.sealed with parafihn and mixed. Media is removed fi-om HEK293 
cells and replaced with 95 ^1 of transfection mixture. Cells are incubated pvaiiight at 

20 37»C,5%C02. 

Qu antification of gene knockdown. A variety of quantification procedures can be 
used to measure the level of silencing induced by siRNA or siRNA pools. In one non- 
limiting example: to measure mRNA levels 24 hrs post-transfection, QuantiGene 
25 branched-DNA (bDNA) kits (Bayer) (Wang, et al. Regulation of insulin preRNA 
splicing by glucose. Proc Natl Acad Sci 1 997, 94:4360.) are used according to 
manufacturer instructions. To measure luciferase activity, media is removed fi-om 
HEK293 cells 24 hrs post-transfection, and 50 ^1 of Steady-GLO reagent (Promega) 
is added. Afl;er 5 min, plates are analyzed on a plate reader. 

30 

Example I. Sequences Used to Develop the Algorithm. 

Anti-Firefly and anti-Cyclophilin siRNAs panels (Figure 5a, b) sorted 
according to using Formula VIII predicted values. All siRNAs scoring more than 0 
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(formula VIII) and more then 20 (formula IX) are ftdly functional, 
sequences for eadi gene (and DBI) appear below in Table III. 



All ninety 



TABLE III 



Cyclo 


1 


SEQ. ID 0032 


Cyclo 


2 


SEQ.ID0033 


Cyclo 


3 


SEQ. ID 0034 


Cyclo 


4 


SEQ. ID 0035 


Cyclo 


5 


SEQ. ID 0036 


Cyclo 


6 


SEQ. ID 0037 


Cyclo 


7 


SEQ. ID 0038 


Cyclo 


- 8 


SEQ. ID 0039 


Cyclo 


9 


SEQ. ID 0040 


Cyclo 


10 


SEQ. ID 0041 


Cyclo 


11 


SEQ. ID 0042 


Cyclo 


12 


SEQ. ID 0043 


Cyclo 


13 


SEQ. ID 0044 


Cyclo 


14 


SEQ. ID 0045 


Cyclo 


15 


SEQ. ID 0046 


Cyclo 


16 


SEQ. ID 0047 


Cyclo 


17 


SEQ. ID 0048 


Cyclo 


18 


SEQ. ID 0049 


Cyclo 


19 


SEQ. ID 0050 


Cyclo 


20 


SEQ. ID 0051 


Cyclo 


21 


SEQ. ID 0052 


Cyclo 


22 


SEQ. ID 0053 


Cyclo 


23 


SEQ. ID 0054 


Cyclo 


24 


SEQ. ID 0055 


Cyclo 


25 


SEQ. ID 0056 


Cyclo 


26 


SEQ. ID 0057 


Cyclo 


27 


SEQ. ID 0058 


Cyclo 


28 


SEQ. ID 0059 


Cyclo 


29 


SEQ. ID 0060 


Cyclo 


30 


SEQ. ID 0061 


Cyclo 


31 


SEQ. ID 0062 


Cyclo 


32 


SEQ. ID 0063 


Cyclo 


33 


SEQ. ID 0064 


Cyclo 


34 


SEQ. ID 0065 



GUUCCAAAAACAGUGGAUA 
UCCAAAAACAGUGGAUAAU 
CAAAAACAGUGGAUAAUUU 
AAAACAGUGGAUAAUUUUG 
AACAGUGGAUAAUUUUGUG 
CAGUGGAUAAUUUUGUGGC 
GUGGAUAAUUUUGUGGCCU 

GGAUAAUUUUGUGGCCUUA 

AUAAUUUUGUGGCCUUAGC 
AAUUUUGUGGCCUUAGCUA 
UUUUGUGGCCUUAGCUACA 
UUGUGGCCUUAGCUACAGG 
GUGGCCUUAGCUACAGGAG 
GGCCUUAGCUACAGGAGAG 
CCUUAGCUACAGGAGAGAA 
UUAGCUACAGGAGAGAAAG 
AGCUACAGGAGAGAAAGGA 
CUACAGGAGAGAAAGGAUU 
ACAGGAGAGAAAGGAUUUG 
AGGAGAGAAAGGAUUUGGC 
GAGAGAAAGGAUUUGGCUA 
GAGAAAGGAUUUGGCUACA 
GAAAGGAUUUGGCUACAAA 
AAGGAUUUGGCUACAAAAA 
GGAUUUGGCUACAAAAACA 
AUUUGGCUACAAAAACAGC 
UUGGCUACAAAAACAGCAA 
GGCUACAAAAACAGCAAAU 
CUACAAAAACAGCAAAUUC 
ACAAAAACAGCAAAUUCCA 
AAAAACAGCAAAUUCCAUC 
AAACAGCAAAUUCCAUCGU 
ACAGCAAAUUCCAUCGUGU 
AGCAAAUUCCAUCGUGUAA 



Cyclo 


35 


SEQ. ID 0066 


Cyclo 


36 


SEQ. ID 0067 


Cyclo 


37 


SEQ. ID 0068 


Cyclo 


38 


SEQ. ID 0069 


Cyclo 


39 


SEQ. ID 0070 


Cyclo 


40 


SEQ. ID 0071 


Cyclo 


41 


SEQ. ID 0072 


Cyclo 


42. 


SEQ. ID 0073 


Cyclo 


43 


SEQ. ID 0074 


Cyclo 


44 


SEQ. ID 0075 


Cyclo 


45 


SEQ. ID 0076 


Cyclo 


46 


SEQ. ID 0077 


Cyclo 


47 


SEQ. ID 0078 


Cyclo 


48 


SEQ. ID 0079 


Cyclo 


49 


SEQ. ID 0080 


Cyclo 


50 


SEQ. ID 0081 


Cyclo 


51 


SEQ. ID 0082 


Cyclo 


52 


SEQ. ID 0083 


Cyclo 


53 


SEQ. ID 0084 


Cyclo 


54 


SEQ. ID 0085 


Cyclo 


55 


SEQ. ID 0086 


Cyclo 


' 56 


SEQ. ID 0087 


Cyclo 


57 


SEQ. ID 0088 


Cyclo 


58 


SEQ. ID 0089 


Cyclo 


59 


SEQ. ID 0090 


Cyclo 


60 


SEQ. ID 0091 


Cyclo 


61 


SEQ. ID 0092 


Cyclo 


62 


SEQ. ID 0093 


Cyclo 


63 


SEQ, ID 0094 


Cyclo 


64 


SEQ. ID 0095 


Cyclo 


65 


SEQ. ID 0094 


Cyclo 


66 


SEQ. ID 0096 


Cyclo 


67 


SEQ. ID 0097 


Cyclo 


68 


SEQ. ED 0098 


Cyclo 


69 


SEQ. ID 0099 


Cyclo 


70 


SEQ. ID 0100 


Cyclo 


71 


SEQ. ID 0101 


Cyclo 


72 


SEQ. ID 0102 


Cyclo 


73 


SEQ. ID 0103 



CAAAUUCCAUCGUGUAAUC 
AAUUCCAUCGUGUAAUCAA 
UUCCAUCGUGUAAUCAAGG 
CCAUCGUGUAAUCAAGGAC 
AUCGUGUAAUCAAGGACUU 
CGUGUAAUCAAGGACUUCA 
UGUAAUCAAGGACUUCAUG 
UAAUCAAGGACUUCAUGAU 
AUCAAGGACUUCAUGAUCC 
CAAGGACUUCAUGAUCCAG 
AGGACUUCAUGAUCCAGGG 
GACUUCAUGAUCCAGGGCG 
CUUCAUGAUCCAGGGCGGA 
:.U^... ... UCAUGAUCCAGGGCGGAGA 

AUGAUCCAGGGCGGAGACU 
GAUCCAGGGCGGAGACUUC 
UCCAGGGCGGAGACUUCAC 
CAGGGCGGAGACUUCACCA 
GGGCGGAGACUUCACCAGG 

GCGGAGACUUCACCAGGGG 
GGAGACUUCACCAGGGGAG 
AGACUUCACCAGGGGAGAU 
ACUUCACCAGGGGAGAUGG 
UUCACCAGGGGAGAUGGCA 
CACCAGGGGAGAUGGCACA 
CCAGGGGAGAUGGCACAGG 
AGGGGAGAUGGCACAGGAG 
GGGAGAUGGCACAGGAGGA 
GAGAUGGCACAGGAGGAAA 
GAUGGCACAGGAGGAAAGA 
UGGCACAGGAGGAAAGAGC 
GCACAGGAGGAAAGAGCAU 
ACAGGAGGAAAGAGCAUCU 
AGGAGGAAAGAGCAUCUAC 
GAGGAAAGAGCAUCUACGG 
GGAAAGAGCAUCUACGGUG 
AAAGAGCAUCUACGGUGAG 
AGAGCAUCUACGGUGAGCG 
AGCAUCUACGGUGAGCGCU 
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Cyclo 


74 


SEQ.ID0104 


CAUCUACGGUGAGCGCUUC 


Cycle 


75 


SEQ.ID0105 


UCUACGGUGAGCGCUUCCC 


Cycld 


76 


SEQ. ID 0106 


UACGGUGAGCGCUUCCCCG 


Cyclo 


77 


SEQ. ID 0107 


CGGUGAGCGCUUCCCCGAU 


Cyclo 


78 


SEQ. ID 0108 


GUGAGCGCUUCCCCGAUGA 


Cyclo 


79 


SEQ. ID 0109 


GAGCGCUUCCCCGAUGAGA 


Cyclo 


80 


SEQ. ED 01 10 


GCGCUUCCCCGAUGAGAAC 


Cyclo 


81 


SEQ.IDOlll 


GCUUCCCCGAUGAGAACUU 


Cyclo 


82 


SEQ. ID 01 12 


UUCCCCGAUGAGAACUUCA 


Cyclo 


83 


SEQ. ID 0113 


CCCCGAUGAGAACUUCAAA 


Cyclo 


84 


SEQ. ED 01 14 


CCGAUGAGAACUUCAAACU 


Cyclo 


85 


SEQ. ID 0115 


GAUGAGAACUUCAAACUGA 


Cyclo 


86 


SEQ. ID 01 16 


UGAGAACUUCAAACUGAAG 


Cyclo 


87 


SEQ. ID 01 17 


AG AACUUCAAACUGAAGCA 


Cyclo 


88 


SEQ. ID 01 18 


AACUUCAAACUGAAGCACU 


Cyclo 


89 


SEQ. ID 01 19 


CUUCAAACUGAAGCACUAC 


Cyclo 


90 


SEQ. ID 0120 


UCAAACUGAAGCACUACGG 


DB 


1 


SEQ. ID 0121 


ACGGGCAAGGCCAAGUGGG 


DB 


2 


SEQ, ID 0122 


CGGGCAAGGCCAAGUGGGA 


DB 


3 


SEQ. ID 0123 


GGGCAAGGCCAAGUGGGAU 


DB 


4 


SEQ. ID 0124 


GGCAAGGCCAAGUGGGAUG 


DB 


5 


SEQ. ID 0125 


GCAAGGCCAAGUGGGAUGC 


DB 


6 


SEQ. ID 0126 


CAAGGCCAAGUGGGAUGCC 


DB 


7 


SEQ. ID 0127 


AAGGCCAAGUGGGAUGCCU 


DB 


8 


SEQ. ID 0128 


AGGCCAAGUGGGAUGCCUG 


DB 


9 


SEQ. ID 0129 


GGCCAAGUGGGAUGCCUGG 


DB 


10 


SEQ. ID 0130 


GCCAAGUGGGAUGCCUGGA 


DB 


11 


SEQ. ID 0131 


CCAAGUGGGAUGCCUGGAA 


DB 


12 


SEQ. ID 0132 


CAAGUGGGAUGCCUGGAAU 


DB 


13 


SEQ. ID 0133 


AAGUGGGAUGCCUGGAAUG 


DB 


14 


SEQ. ID 0134 


AGUGGGAUGCCUGGAAUGA 


DB 


15 


SEQ. ID 0135 


GUGGGAUGCCUGGAAUGAG 


DB 


16 


SEQ. ED 0136 


UGGGAUGCCUGGAAUGAGC 


DB 


17 


SEQ. ID 0137 


GGGAUGCCUGGAAUGAGCU 


DB 


18 


SEQ. ID 0138 


GGAUGCCUGGAAUGAGCUG 


DB 


19 


SEQ. ID 0139 


GAUGCCUGGAAUGAGCUGA 


DB 


20 


SEQ. ID 0140 


AUGCCUGGAAUGAGCUGAA 


DB 


21 


SEQ. ID 0141 


UGCCUGGAAUGAGCUGAAA 


DB 


22 


SEQ. ID 0142 


GCCUGGAAUGAGCUGAAAG 


DB 


23 


SEQ. ID 0143 


CCUGGAAUGAGCUGAAAGG 
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DB 


24 


SEQ.ID0144 


DB 


25 


SEQ.ID0145 


DB 


26 


SEQ.ID0146 


DB 


27 


SEQ.ID0147 


DB 


28 


SEQ. ID 0148 


DB 


29 


SEQ. ID 0149 


DB 


30 


SEQ. ID 0150 


DB 


31 


SEQ. ID 0151 


DB 


32 


SEQ. ID 0152 


DB 


33 


SEQ. ID 0153 


DB 


34 


SEQ. ED 0154 


DB 


35 


SEQ. ID 0155 


DB 


36 


SEQ. ID 0156 


DB 


. 37. 


SEQ. ID 0157 


DB 


38 


SEQ. ED 0158 


DB 


39 


SEQ. ID 0159 


DB 


40 


SEQ. ED 0160 


DB 


41 


SEQ. ED 0161 


DB 


42 


SEQ. ED 0162 


DB 


43 


SEQ. ID 0163 


DB 


44 


SEQ, ID 0164 


DB 


45 


SEQ. ID 0165 


DB 


46 


SEQ. ID 0166 


DB 


47 


SEQ. ID 0167 


DB 


48 


SEQ, ID 0168 


DB 


49 


SEQ. ID 0169 


DB 


50 


SEQ. ID 0170 


DB 


51 


SEQ. ED 0171 


DB 


52 


SEQ. ID 0172 


DB 


53 


SEQ. ID 0173 


DB 


54 


SEQ. ID 0174 


DB 


55 


SEQ. ED 0175 


DB 


56 


SEQ. BD 0176 


DB 


57 


SEQ. ID 0177 


DB 


58 


SEQ. ID 0178 


DB 


59 


SEQ. ID 0179 


DB 


60 


SEQ, ID 0180 


DB 


61 


SEQ. BD 0181 


DB 


62 


SEQ. ID 0182 


DB 


63 


SEQ. ID 0183 



CUGGAAUGAGCUGAAAGGG 

UGGAAUGAGCUGAAAGGGA 

GGAAUGAGCUGAAAGGGAC 

GAAUGAGCUGAAAGGGACU 

AAUGAGCUGAAAGGGACUU 

AUGAGCUGAAAGGGACUUC 

UGAGCUGAAAGGGACUUCC 

GAGCUGAAAGGGACUUCCA 

AGCUGAAAGGGACUUCCAA 

GCUGAAAGGGACUUCCAAG 

CUGAAAGGGACUUCCAAGG 

UGAAAGGGACUUCCAAGGA 

GAAAGGGACUUCCAAGGAA 

AAAGGGACUUCCAAGGAAG 

AAGGGACUUCCAAGGAAGA 

AGGGACUUCCAAGGAAGAU 

GGGACUUCCAAGGAAGAUG 

GGAGUUCCAAGGAAGAUGC 

GACUUCCAAGGAAGAUGCC 

ACUUCCAAGGAAGAUGCCA 

CUUCCAAGGAAGAUGCCAU 

UUCCAAGGAAGAUGCCAUG 

UCCAAGGAAGAUGCCAUGA 

CCAAGGAAGAUGCCAUGAA 

CAAGGAAGAUGCCAUGAAA 

AAGGAAGAUGCCAUGAAAG 

AGGAAGAUGCCAUGAAAGC 

GGAAGAUGCCAUGAAAGCU 

GAAGAUGCCAUGAAAGCUU 

AAGAUGCCAUGAAAGCUUA 

AGAUGCCAUGAAAGCUUAC 

GAUGCCAUGAAAGCUUACA 

AUGCCAUGAAAGCUUACAU 

UGCCAUGAAAGCUUACAUC 

GCCAUGAAAGCUUACAUCA 

CCAUGAAAGCUUACAUCAA 

CAUGAAAGCUUACAUCAAC 

AUGAAAGCUUACAUCAACA 

UGAAAGCUUACAUCAACAA 

GAAAGCUUACAUCAACAAA 



DB 


64 


SEQ.ID0184 


DB 


65 


SEQ. ID 0185 


DB 


66 


SEQ. ID 0186 


DB 


67 


SEQ. ID 0187 


DB 


68 


SEQ. ID 0188 


DB 


69 


SEQ. ID 0189 


DB 


70 


SEQ. ID 0190 


DB 


71 


SEQ. ID 0191 


DB 


72 


SEQ. m 0192 


DB 


73 


SEQ. ID 0193 


DB 


74 


SEQ. ID 0194 


DB 


75 


SEQ. ID 0195 


DB 


76 


SEQ. ED 0196 




77 


SEQ. ID.0197 


DB 


78 


SEQ. ID 0198 


DB 


79 


SEQ. ID 0199 


DB 


80 


SEQ. ID 0200 


DB 


81 


SEQ. ID 0201 


DB 


82 


SEQ. ID 0202 


DB 


83 


SEQ. ID 0203 


DB 


84 


SEQ. ID 0204 


DB 


85 


SEQ. ID 0205 


DB 


86 


SEQ. ID 0206 


DB 


87 


SEQ. ID 0207 


DB 


88 


SEQ. ID 0208 


DB 


89 


SEQ. ID 0209 


DB 


90 


SEQ. ID 0210 


Lac 


1 


SEQ. ID 0211 


Luc 


2 


SEQ. ID 0212 


Luc 


3 


SEQ. ID 0213 


Luc 


4 


SEQ. ID 0214 


Luc 


5 


SEQ. ID 0215 


Luc 


6 


SEQ. ID 0216 


Luc 


7 


SEQ. ID 0217 


Luc 


8 


SEQ. ID 0218 


Luc 


9 


SEQ. ID 02 19 


Luc 


10 


SEQ. ID 0220 


Luc 


11 


SEQ. ID 0221 


Luc 


12 


SEQ. ID 0222 


Luc 


13 


SEQ. ID 0223 



AAAGCUUACAUCAACAAAG 

AAGCUUACAUCAACAAAGU 

AGCUUACAUCAACAAAGUA 

GCXJUACAUCAACAAAGUAG 

CUUACAUCAACAAAGUAGA 

UUACAUCAACAAAGUAGAA 

UACAUCAACAAAGUAGAAG 

ACAUCAACAAAGUAGAAGA 

CAUCAACAAAGUAGAAGAG 

AUCAACAAAGUAGAAGAGC 

UCAACAAAGUAGAAGAGCU 

CAACAAAGUAGAAGAGCUA 

AACAAAGUAGAAGAGCUAA 

ACAAAGUAGAAGAGCUAAA 

CAAAGUAGAAGAGCUAAAG 

AAAGUAGAAGAGCUAAAGA 

AAGUAGAAGAGCUAAAGAA 

AGUAGAAGAGCUAAAGAAA 

GUAGAAGAGCUAAAGAAAA 

UAGAAGAGCUAAAGAAAAA 

AGAAGAGCUAAAGAAAAAA 

GAAGAGCUAAAGAAAAAAU 

AAGAGCUAAAGAAAAAAUA 

agagcuaaagaaaaaauac 

gagcuaaagaaaaaauacg 
agcuaaagaaaaaauacgg 

GCUAAAGAAAAAAUACGGG 
AUCCUCAUAAAGGCCAAGA 
AGAUCCUCAUAAAGGCCAA 
AGAGAUCCUCAUAAAGGCC 
AGAGAGAUCCUCAUAAAGG 
UCAGAGAGAUCCUCAUAAA 
AAUCAGAGAGAUCCUCAUA 
AAAAUCAGAGAGAUCCUCA 
GAAAAAUCAGAGAGAUCCU 
AAGAAAAAUCAGAGAGAUC 
GCAAGAAAAAUCAGAGAGA 
ACGCAAGAAAAAUCAGAGA 
CGACGCAAGAAAAAUCAGA 
CUCGACGCAAGAAAAAUCA 



Luc 


14 


SEQ.ro 0224 


AACUCGACGCAAGAAAAAU 


Luc 


15 


SEQ. ro 0225 


AAAACUCGACGCAAGAAAA 


Luc 


16 


SEQ. ro 0226 


GGAAAACUCGACGCAAGAA 


Luc 


17 


SEQ. ro 0227 


CCGGAAAACUCGACGCAAG 


Luc 


18 


SEQ. ro 0228 


UACCGGAAAACUCGACGCA 


Luc 


19 


SEQ, ro 0229 


CUUACCGGAAAACUCGACG 


Luc 


20 


SEQ. ro 0230 


GUCUUACCGGAAAACUCGA 


Luc 


21 


SEQ.ro 0231 


AGGUCUUACCGGAAAACUC 


Luc 


22 


SEQ, ro 0232 


AAAGGUCUUACCGGAAAAC 


Luc 


23 


SEQ, ro 0233 


CGAAAGGUCUUACCGGAAA 


Luc 


24 


SEQ. ro 0234 


ACCGAAAGGUCUUACCGGA 


Luc 


25 


SEQ. ro 0235 


GUACCGAAAGGUCUUACCG 


Luc 


26 


SEQ. ro 0236 


AAGUACCGAAAGGUCUUAC 


Luc 


27 


SEQ. ro 0237 - 


CGAAGUACCGAAAGGUCUU 


Luc 


28 


SEQ. ro 0238 


GACGAAGUACCGAAAGGUC 


Luc 


29 


SEQ. ro 0239 


UGGACGAAGUACCGAAAGG 


Luc 


30 


SEQ. ro 0240 


UGUGGACGAAGUACCGAAA 


Luc 


31 


SEQ. ro 0241 


' UUUGUGGACGAAGUACCGA 


Luc 


32 


SEQ.ro 0242 


UGUUUGUGGACGAAGUACC 


Luc 


33 


SEQ. ro 0243 


UGUGUUUGUGGACGAAGUA 


Luc 


34 


SEQ. ro 0244 


GUUGUGUUUGUGGACGAAG 


Luc 


35 


SEQ.ro 0245 


GAGUUGUGUUUGUGGACGA 


Luc 


36 


SEQ.ro 0246 


AGGAGUUGUGUUUGUGGAC 


Luc 


37 


SEQ. ro 0247 


GGAGGAQUUGUGUUUGUGG 


Luc 


38 


SEQ. ro 0248 


GCGGAGGAGUUGUGUUUGU 


Luc 


39 


SEQ. ro 0249 


GCGCGGAGGAGUUGUGUUU 


Luc 


40 


SEQ. ro 0250 


UUGCGCGGAGGAGUUGUGU 


Luc 


41 


SEQ.ro 0251 


AGUUGCGCGGAGGAGUUGU 


Luc 


42 


SEQ. ro 0252 


AAAGUUGCGCGGAGGAGUU 


Luc 


43 


SEQ. ro 0253 


AAAAAGUUGCGCGGAGGAG 


Luc 


44 


SEQ. ro 0254 


CGAAAAAGUUGCGCGGAGG 


Luc 


45 


SEQ. ro 0255 


CGCGAAAAAGUUGCGCGGA 


Luc 


46 


SEQ. ro 0256 


ACCGCGAAAAAGUUGCGCG 


Luc 


47 


SEQ. ro 0257 


CAACCGCGAAAAAGUUGCG 


Luc 


48 


SEQ. ro 0258 


AACAACCGCGAAAAAGUUG 


Luc 


49 


SEQ. ro 0259 


GUAACAACCGCGAAAAAGU 


Luc 


50 


SEQ. ro 0260 


AAGUAACAACCGCGAAAAA 


Luc 


51 


SEQ. ro 0261 


UCAAGUAACAACCGCGAAA 


Luc 


52 


SEQ. ro 0262 


AGUCAAGUAACAACCGCGA 


Luc 


53 


SEQ. ro 0263 


CCAGUCAAGUAACAACCGC 
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Luc 


54 


SEQ. ID 0264 


Luc 


55 


SEQ. ID 0265 


Luc 


56 


SEQ. ID 0266 


Luc 


57 


SEQ. ID 0267 


Luc 


58 


SEQ. ID 0268 


Luc 


59 


SEQ. ID 0269 


Luc 


60 


SEQ. ED 0270 


Luc 


61 


SEQ. ID 0271 


Luc 


62 


SEQ. ID 0272 


Luc 


63 


SEQ. ID 0273 


Luc 


64 


SEQ.ro 0274 


Luc 


65 


SEQ.ro 0275 


Luc 


66 


SEQ. ro 0276 


Luc 


67 


SEQ. ro 0277 


Luc 


68 


SEQ. ro 0278 


Luc 


69 


SEQ. ro 0279 


Luc 


70 


SEQ. ro 0280 


Luc 


71 


SEQ.ro 0281 


Luc 


72 


SEQ.ro 0282 


Luc 


73 


SEQ.ro 0283 


Luc 


74 


SEQ.ro 0284 


Luc 


75 


SEQ. ro 0285 


Luc 


76 


SEQ.ro 0286 


Luc 


77 


SEQ.ro 0287 


Luc 


78 


SEQ. ID 0288 


Luc 


79 


SEQ. ro 0289 


Luc 


80 


SEQ. ro 0290 


Luc 


81 


SEQ.ro 0291 


Luc 


82 


SEQ. ro 0292 


Luc 


83 


SEQ. ro 0293 


Luc 


84 


SEQ.ro 0294 


Luc 


85 


SEQ. ro 0295 


Luc 


86 


SEQ. ro 0296 


Luc 


87 


SEQ.ro 0297 


Luc 


88 


SEQ.ro 0298 


Luc 


89 


SEQ. ro 0299 


Luc 


90 


SEQ.ro 0300 



CGCCAGUCAAGUAACAACC 

GUCGCCAGUCAAGUAACAA 

ACGUCGCCAGUCAAGUAAC 

UUACGUCGCCAGUCAAGUA 

GAUUACGUCGCCAGUCAAG 

UGGAUUACGUCGCCAGUCA 

CGUGGAUUACGUCGCCAGU 

AUCGUGGAUUACGUCGCCA 

AGAUCGUGGAUUACGUCGC 

AGAGAUCGUGGAUUACGUC 

AAAGAGAUCGUGGAUUACG 

AAAAAGAGAUCGUGGAXJUA 

GGAAAAAGAGAUCGUGGAU 

ACGGAAAAAGAGAUCGUGG 

UGACGGAAAAAGAGAUCGU 

GAUGACGGAAAAAGAGAUC 

ACGAUGACGGAAAAAGAGA 

AGACGAUGACGGAAAAAGA 

AAAGACGAUGACGGAAAAA 

GGAAAGACGAUGACGGAAA 

ACGGAAAGACGAUGACGGA 

GCACGGAAAGACGAUGACG 

GAGCACGGAAAGACGAUGA 

UGGAGCACGGAAAGACGAU 

UUUGGAGCACGGAAAGACG 

GUUUUGGAGCACGGAAAGA 

UUGUUUUGGAGCACGGAAA 

UGUUGUUUUGGAGCACGGA 

GUUGUUGUUUUGGAGCACG 

CCGUUGUUGUUUUGGAGCA 

CGCCGUUGUUGUUUUGGAG 

GCCGCCGUUGUUGUUUUGG 

CCGCCGCCGUUGUUGUUUU 

UCCCGCCGCCGUUGUUGUU 

CUUCCCGCCGCCGUUGUUG 

AACUUCCCGCCGCCGUUGU 

UGAACUUCCCGCCGCCGUU 
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Example II. Validation of the Algorithm using DBI, Luciferase, PLK, EGFR, 
and SEAP 

The algorithm (Formula VIII) identified siRNAs for five genes, human DBI, 
firefly luciferase (fLuc), renilla luciferase (rLuc), human PLK, and human secreted 

5 alkaline phosphatase (SEAP). Four individual siRNAs were selected on the basis of 
their SMARTscores™ derived by analysis of their sequence using Formula VIII (all 
of the siRNAs would be selected with Formula IX as well) and analyzed for their 
ability to silence their targets' expression. In addition to the scoring, a BLAST search 
. was conducted for each siRNA. To minimize the potential for off-target silencing 

1 0 effects, only those target sequences with more than three mismatches against xm- 

related sequences were selected. Semizarov, et al, Specificity of short interfering UNA 
determined through gene expression signatures. Proc. Natl. Acad. Sci. U.S.A. 2003, 
100:6347. These duplexes were analyzed individually and in pools of 4 and 
compared with several siRNAs that were randomly selected. The functionality was 

1 5 measured a percentage of targeted gene knockdown as compared to controls. All 

siRNAs were transfected as described by the methods above at 1 00 nM concentration 
into HEK293 using Lipofectamine 2000. The level of the targeted gene expression 
was evaluated by B-DNA as described above and normalized to the non-specific 
control. Figure 10 shows that the siRNAs selected by the algorithm disclosed herein 

20 were significantly more potent than randomly selected siRNAs. The algorithm 

increased the chances of identifying an F50 siRNA firom 48% to 91%, and an F80 
siRNA firom 13% to 57%. In addition, pools of SMART siRNA silence the selected 
target bettor than randomly selected pools (see Figure 1 OF). 

25 Example Uh Validation of the Algorithm Using Genes Involved in Clathrln- 
Dependent Endocytosis. 

Components of clathrin-mediated endocytosis pathway are key to modulating 
intracellular signaling and play important roles in disease. Chromosomal 
rearrangements that result in fusion transcripts between the Mixed-Lineage Leukemia 
30 gene (MLL) and CALM (Clathrin assembly lymphoid myeloid leukemia gene) are 
believed to play a role in leukemogenesis. Similarly, disruptions in Rab7 and Rab9, 
as well as HIPl (Huntingtin-interacting protein), genes that are believed to be 
involved in endocytosis, are potentially responsible for ailments resulting in lipid 
storage, and neuronal diseases, respectively. For these reasons, siRNA directed 
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against clathrin and other genes involved in the clathrin-mediated endocytotic 
pathway are potentially important research and therapeutic tools. 

siRNAs directed against genes involved in the clathrin-mediated endocytosis 
5 pathways were selected using Foraiula VIII. The targeted genes were clathrin heavy 
chain (CHC, accession # NM_004859), clathrin light chain A (CLCa, NM_001833), 
clathrin light chain B (CLCb, NM_001834), CALM (U45976), p2 subunit of AP-2 
(p2, NM_001282), Epsl5 (NM__001981), EpslSR (NM_021235), dynamin II 
(DYNII, NM_004945), RabSa (BC001267), RabSb (NM_002868), RabSc 
10 (AF141304), and EEA,1 (XM_018197). 

For each gene, four siRNAs duplexes with the highest scores were selected 
and a BLAST search was conducted for each of them using the Human EST database. 
In order to minimize the potential for off-target silencing effects, only those 
1 5 sequences with more than three mismatches against im-related sequences were used. 
All duplexes were synthesized at Dharmacon, lac. as 21-mers with 3'-UU overhangs 
using a modified method of 2'- ACE chemistry Scaringe, Advanced 5'-sifyl'2'- 
orthoester approach to UNA oligonucleotide synthesis, Methods Enzymol 2000, 3 17:3 
and the antisense strand was chemically phosphorylated to insure maximized activity. 

20 

HeLa cells were grown in Dulbecco's modified Eagle's medium (DMEM) 
containing 10% fetal bovine serum, antibiotics and glutamine. siRNA duplexes were 
resuspended in IX siRNA Universal buffer (Dharmacoii-Inc.) to 20p.M prior to 
transfection. HeLa cells in 12-well plates were transfected twice with 4^1 of 20^M 

25 siRNA duplex in 3^1 Lipofectamine 2000 reagent (Invitrogen, Carlsbad, California, 
USA) at 24-hour intervals. For the transfections in which 2 or 3 siRNA duplexes were 
included, the amount of each duplex was decreased, so that the total amount was the 
same as in transfections with single siRNAs. Cells were plated into normal culture 
medium 1 2 hours prior to experiments, and protein levels were measured 2 or 4 days 

30 after the first transfection. 

Equal amounts of lysates were resolved by electrophoresis, blotted, and 
stained with the antibody specific to targeted protein, as well as antibodies specific to 
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unrelated proteins, PPl phosphatase and TsglOl (not shown). The cells were lysed in 
Triton X-lOO/glycerol solubilization buffer as described previously. Tebar, 
Bohlander, & Sorkin, Clathrin Assembly Lymphoid Myeloid Leukemia (CALM) 
Protein: Localization in Endocytic-coated Pits, Interactions with Clathrin, and the 
5 Impact ofOverexpression on Clathrin-mediated Traffic, Mol. Biol. Cell Aug 1999, 
10:2687. Cell lysates were electrophoresed, transferred to nitrocellulose membranes, 
and Western blotting was performed with several antibodies followed by detection 
using enhanced chemiluminescence system (Pierce, Inc). Several x-ray films were 
analyzed to determine the linear range of the chemiluminescence signals, and the 
10 quantifications were performed using densitometry and Alphalmager v5.5 software 
(Alpha Innotech Corporation). In experiments with EpslSR-targeted siRNAs, cell 
lysates were subjected to immunoprecipitation with Ab860, and EpslSR was detected 
in immunoprecipitates by Western blotting as described above. 

1 5 The antibodies to assess the levels of each protein by Western blot were 

obtained fi-om the following sources: monoclonal antibody to clathrin heavy chain 
(TD.l) was obtained fi-om American Type Culture Collection (Rockville, MD, USA); 
polyclonal antibody to dynamin II was obtained firom Affinity Bioreagents, Inc. 
(Golden, CO, USA); monoclonal antibodies to EEA.l and RabSa were purchased 

20 firom BD Transduction Laboratories (Los Angeles, CA, USA); the monoclonal 

antibody to TsglOl was purchased from Santa Cruz Biotechnology, Inc. (Santa Cruz, 
CA, USA); the monoclonal antibody to GFP was from ZYMED Laboratories Inc. 
(South San Francisco, CA, USA); the rabbit polyclonal.aixtibodies Ab32 specific to a- 
adaptins and Ab20 to CALM were described previously Sorkin, et al, Stoichiometric 

25 Interaction of the Epidermal Growth Factor Receptor with the Clathrin-associated 
Protein Complex AP-2, J. Biol. Chem. Jan 1995, 270:619, the polyclonal antibodies to 
clathrin light chains A and B were kindly provided by Dn F. Brodsky (UCSF); 
monoclonal antibodies to PPl (BD Transduction Laboratories) and a-Actinin 
(Chemicon) were kindly provided by Dr. M. DeirAcqua (University of Colorado); 

30 Epsl 5 Ab577 and Epsl5R Ab860 were kindly provided by Dr. P.P. Di Fiore 
(European Cancer Institute). 
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Figure 11 demonstrates the in vivo functionality of 48 individual siRNAs, 
selected using Formula VIII (most of them will meet the criteria incorporated by 
Formula DC as well) targeting 12 genes. Various cell lines were transfected with 
siRNA duplexes (Pupl-4) or pools of siRNA duplexes (Pool), and the cells were 
5 lysed 3 days after transfection with the exception of CALM (2 days) and P2 (4 days). 

Note a pl-adaptin band (part of AP-1 Golgi adaptor complex) that runs 
shghtly slower than P2 adaptin. CALM has two splice variants, 66 and 72 kD. The 
fulWength EpslSR (a doublet of -130 kD) and several truncated spliced forms of - 

10 1 00 kD and -70 kD were detected in Epsl 5R immunoprecipitates (shown by arrows). 
The cells were lysed 3 days after transfection. Equal amoxints of lysates were 
resolved by electrophoresis and blotted with the antibody specific to a targeted protein 
(GFP antibody for YFP fusion proteins) and the antibody specific to unrelated 
proteins PPl phosphatase or a-actinin, and TSGIOI. The amount of protein in each 

1 5 specific band was normalized to the amount of non-specific proteins in each lane of 
the gel Nearly all of them appear to be fimctional, which establishes that Formula 
VIII and IX can be used to predict siRNAs' fimctionality in general in a genome wide 
manner. 

20 To generate the fiision of yellow fluorescent protein (YFP) with RabSb or 

RabSc (YFP-Rab5b or YFP-Rab5c), a DNA firagment encoding the fiill-length human 
RabSb or RabSc was obtained by PGR using Pfu polymerase (Stratagene) with a Sad 
restriction site introduced into the 5' end and a Kpnl sitfeMnf^fee 3' end and cloned 
into pEYFP-Cl vector (CLONTECH, Palo Alto, CA, USA). GFP-CALM and YFP- 

25 RabSa were described previously Tebar, Bohlander, & Sorkin, Clathrin Assembly 

Lymphoid Myeloid Leukemia (CALM) Protein: Localization in Endocytic-coated Pits, 
Interactions with Clathrin, and the Impact of Overexpression on Clathrin-mediated 
Traffic, Mol. BioL Cell Aug 1999, 10:2687. 

30 Example in. VaUdation of the Algorithm Using EgS, GADPH, ATEl, MEK2, 
MEKl, QB, LaminA/C, c-myc, human cydophilin, and mouse cyclophilin. 
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A number of genes have been identified as playing potentially important roles 
in disease etiology. Expression profiles of normal and diseased kidneys has 
implicated Edg5 in inununoglobulin A neuropathy, a common renal glomerular 
disease. Mycl, MEKl/2 and other related kinases have been associated with one or 
5 more cancers, while lamins have been implicated in muscular dystrophy and other 
diseases. For these reasons, siRNA directed against the genes encoding these classes 
of molecules would be important research and therapeutic tools. 

Figure 12 illustrates four siRNAs targeting 10 different genes (Table Vfor 
10 sequence and accession number information) that were selected according to the 
Formula VIII and assayed as individuals and pools in HEK293 cells. The level of 
siRNA induced silencing was measured using the B-DNA assay. These studies 
demonstrated that thirty-six out of the forty individual SMART-selected siRNA tested 
are fimctional (90%) and all 10 pools are fiiUy fiinctional. 

15 

Example V. Validation of the Algorithm Using Bcl2 

Bcl-2 is a -25kD, 205-239 amino acid, anti-apoptotic protein that contains 
considerable homology with other members of the BCL family including BCLX, 
MCLl, BAX, BAD, and BIK. The protein exists in at least two forms (Bcl2a, which 

20 has a hydrophobic t^l for membrane anchorage, and Bcl2b, which lacks the 

hydrophobic tail) and is predominantly localized to the mitochondrial membrane. 
While Bcl2 expression is widely distributed, particular interest has focused on the 
expression of this molecule in B and T cells. Bcl2 expressiQn.is.idio^n-regulated in 
normal germinal center B cells yet in a high percentage of follicular lymphomas, Bcl2 

25 expression has been observed to be elevated. Cytological studies have identified a 
common translocation ((14;1 8)(q32;q32)) amongst a high percentage (>70%) of these 
lymphomas. This genetic lesion places the Bcl2 gene in juxtaposition to 
immunoglobulin heavy chain gene (IgH) encoding sequences and is believed to 
enforce inappropriate levels of gene expression, and resistance to programmed cell 

30 death in the follicle center B cells. In other cases, hypomethylation of the Bcl2 

promoter leads to enhanced expression and again, inhibition of apoptosis. In addition 
to cancer, dysregulated expression of Bcl-2 has been correlated with multiple sclerosis 
and various neurological diseases. 
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The correlation between Bcl-2 translocation and cancer makes this gene an 
attractive target for RNAi. Identification of siRNA directed against the bcl2 transcript 
(or Bcl2-IgH fusions) would further our understanding Bcl2 gene function and 
possibly provide a future therapeutic agent to battle diseases that result fi"om altered 
5 expression or function of this gene. 

In Silico Identification of Functional siRNA 

To identify functional and hyperfunctionai siRNA against the Bcl2 gene, the 
sequence for Bcl-2 was downloaded firom the NCBI Unigene database and analyzed 
10 using the Formula VIII algorithm. As a result of these procedures, both the sequence 
and SMARTscores™ of the Bcl2 siRNA were obtained and ranked according to their 
functionality. Subsequently, these sequences were BLAST' ed (database) to insure that 
the selected sequences were specific and contained minimal overlap with unrealated 
genes. The SMARTscores™ for the top 10 Bcl-2 siRNA are identified in Figure 13. 

15 

In Vivo Testing of Bcl-2 SiRNA 

Bcl-2 siRNAs having the top ten SMARTscores'^^ were selected and tested in 
a functional assay to determine silencing efficiency. To accomplish this, each of the 
ten duplexes were synthesized using 2' -O- ACE chemistry and transfected at lOOnM 
20 concentrations into cells,; Twenty-four hours later assays were performed on cell 
extracts to assess the degree of target silencing. Controls used in these experiments 
included mock transfected cells, and cells that were transfected with a non-specific 
siRNA duplex. , ^ 

25 The results of these experiments are presented below (and in Figure 14) and 

show tiiat all ten of the selected siRNA induce 80% or better silencing of the Bcl2 
message at lOOnM concentrations. These data verify that the algorithm successfully 
identified functional Bcl2 siRNA and provide a set of functional agents that can be 
used in experimental and therapeutic environments. 

30 
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siRNA 1 GGGAGAUAGUGAUGAAGUA 
siRNA2 GAAGUACAUCCAUUAUAAG 
siRNA 3 GUACGACAACCGGGAGAUA 
. siRNA 4 AGAUAGUGAUGAAGUACAU 

SiRNA 5 UGAAGACUCUGCUCAGUUU 
SiRNA 6 GCAUGCGGCCUCUGUUUGA 
SiRNA? UGCGGCCUCUGUUUGAUUU 
siRNA 8 GAGAUAGUGAUGAAGUACA 
siRNA 9 GGAGAUAGUGAUGAAGUAC 
siRNA 10 GAAGACUCUGCUCAGUUUG 

10 Bcl2 SiRNA: Sense Strand, 5'->3' 

Example VI. Sequences Selected by the Algorithm 

Sequences of the siRNAs selected using Formulas (Algorithms) VIII and EX 
1 5 with their corresponding ranking, which have been evaluated for the silencing activity 
in vivo in the present study (Formula VIII and DC, re^ectively). 



TABLE V 



Gene 


Accession 






Formula 


Formula 


Name 


Number 


SEQ. ID NO, 


FTUSeqTence 


VUI 


IX 


CLTC 


NM_004859 


SEQ.IDNO. 0301 


GAAAGAATCTGTAGAGAAA 


76 


94.2 


CLTC 


NM_004859 


SEQ. ID NO. 0302 


GCAATGAGCTGTTTGAAGA 


65 


39.9 


CLTC 


NM__^004859 


SEQ.IDNO. 0303 


TGACAAAGGTGGATAAATT 


57 


38.2 


CLTC 


NM_004859 


SEQ.IDNO. 0304 


GGAAATGGATCTCTTTGAA, 


54 


49.4 


CLTA 


NM_.001833 


SEQ.IDNO. 0305 


GGAAAGTAATGGTCCAACA 


22 


55.5 


CLTA 


NM__001833 


SEQ.IDNO. 0306 


AGACAGTTATGCAGCTATT 


4 


22.9 


CLTA 


NM_001833 


SEQ. ID NO. 0307 


CCAATTCTCGGAAGCAAGA 


1 


17 


CLTA 


NM__001833 


SEQ. ID NO. 0308 


GAAAGTAATGGTCCAACAG 


-1 


-13 


CLTB 


NM_001834 


SEQ. ID NO. 0309 


GCGCCAGAGTGAACAAGTA 


17 


57.5 


CLTB 


NM_^001834 


SEQ.IDNO. 0310 


GAAGGTGGCCCAGCTATGT 


15 


-8.6 


CLTB 


NM__001834 


SEQ.IDNO. 0311 


GGAACCAGCGCCAGAGTGA 


13 


40.5 


CLTB 


NM_001834 


SEQ.IDNO. 0312 


GAGCGAGATTGCAGGCATA 


20 


61.7 


CALM 


U45976 


SEQ.IDNO. 0313 


GTTAGTATCTGATGACTTG 


36 


-34.6 


CALM 


U45976 


SEQ.IDNO. 0314 


GAAATGGAACCACTAAGAA 


33 


46.1 


CALM 


U45976 


SEQ.IDNO. 0315 


GGAAATGGAACCACTAAGA 


30 


61.2 


CALM 


U45976 


SEQ.IDNO. 0316 


CAACTACACTTTCCAATGC 


28 


6.8 


EPS15 


NM_001981 


SEQ.IDNO. 0317 


CCACCAAGATTTCATGATA 


48 


25.2 



SEQ.IDNO. 301 
SEQ. ID NO. 302 
SEQ.IDNO. 303 
SEQ. ID NO. 304 
SEQ. ID NO. 305 
SEQ. ID NO. 306 
SEQ.IDNO. 307 
SEQ. ID NO. 308 
SEQ. ID NO. 309 
SEQ. ID NO. 310 



EPS15 


NM_001981 


SEQ.IDNO. Q318 


EPS 15 


NM_001981 


SEQ.IDNO. 0319 


EPS 15 


NM_001981 


SEQ.IDNO. 0320 


EPS15R 


NM_021235 


SEQ. ID NO. 0321 


EPS15R 


NM_^021235 


SEQ.IDNO. 0322 


EPS15R 


NM_021235 


SEQ.IDNO. 0323 


EPS15R 


NM_^021235 


SEQ.IDNO. 0324 


DNM2 


NM_004945 


SEQ. ID NO. 0325 


DNM2 


NM_004945 


SEQ.IDNO. 0326 


DNM2 


NM_004945 


SEQ.IDNO. 0327 


DNM2 


NM_004945 


SEQ.IDNO. 0328 


ARF6 


AF93885 


SEQ.IDNO. 0329 


ARF6 


AF93885 


SEQ. ID NO. 0330 


ARF6 


AF93885 


SEQ.IDNO. 0331 


ARF6 


AF93885 


SEQ.IDNO. 0332 


RAB5A 


BC001267 


SEQ. ID NO. 0333 


RAB5A 


BC001267 


SEQ. ID NO. 0334 


RAB5A 


BC001267 


SEQ.IDNO. 0335 


RAB5A 


BC001267 


SEQ.IDNO. 0336 


RAB5B 


1SIM_002868 


SEQ.IDNO. 0337 


RAB5B 


NM_002868 


SEQ.IDNO. 0338 


RAB5B 


NM_002868 


SEQ.IDNO. 0339 


RAB5B 


NM_002868 


SEQ. ID NO. 0340 


RAB5C 


AF141304 


SEQ. ©NO. 0341 


RAB5C 


AF141304 


SEQ.IDNO. 0342 


RAB5C 


AF141304 


SEQ.IDNO. 0343 


RAB5C 


AF141304 


SEQ.IDNO. 0344 


EEAl 


XM_018197 


SEQ.IDNO. 0345 


EEAl 


XM_018197 


SEQ. ED NO. 0346 


EEAl 


XM_018197 


SEQ. ID NO. 0347 


EEAl 


XM_018197 


SEQ. ID NO. 0348 


AP2B1 


NM_001282 


SEQ.IDNO. 0349 


AP2B1 


NM_001282 


SEQ.IDNO. 0350 


AP2B1 


NM_001282 


SEQ.IDNO. 0351 


AP2B1 


NM_001282 


SEQ.IDNO. 0352 


PLK 


NM_005030 


SEQ.IDNO. 0353 


PLK 


NM__005030 


SEQ. ID NO. 0354 


PLK 


NM_005030 


SEQ.IDNO. 0355 


PLK 


NM_005030 


SEQ.IDNO. 0356 


GAPDH 


NM_002046 


SEQ.IDNO. 0357 



GATCGGAACTCCAACAAGA 


43 


49.3 


AAACGGAGCTACAGATTAT 


39 


11.5 


CCACACAGCATTCTTGTAA 


33 


-23.6 


GAAGTTACCTTGAGCAATC 


48 


33 


GGACTTGGCCGATCCAGAA 


27 


33 


GCACTTGGATCGAGATGAG 


20 


1.3 


CAAAGACCAATTCGCGTTA 


17 


27.7 


CCGAATCAATCGCATCTTC 


6 


-29.6 


GACATGATCCTGCAGTTCA 


5 


-14 


GAGCGAATCGTCACCACTT 


5 


24 


CCTCCGAQCTGGCGTCTAC 


-4 


-63.6 


TCACATGGTTAACCTCTAA 


27 


-21.1 


GATGAGGGACGCCATAATC 


7 


-38.4 


CCTCTAACTACAAATCTTA 


4 


16.9 


GGAAGGTGCTATCCAAAAT 


4 


11.5 


GCAAGCAAGTCCTAACATT 


40 


25.1 


GGAAGAGGAGTAGACCTTA 


17 


50.1 


AGGAATCAGTGTTGTAGTA 


16 


1 1.5 


GAAGAGGAGTAGACCTTAC 


12 


7 


GAAAGTCAAGCCTGGTATT 


14 


18.1 


AAAGTCAAGCCTGGTATTA 


6 


-17.8 


GCTATGAACGTGAATGATC 


3 


-21.1 


CAAGCCTGGTATTACGTTT 


-7. 


-37.5 


GGAACAAGATCTGTCAATT 


38 


51.9 


GCAATGAACGTGAACGAAA 


29 


43.7 


CAATGAACGTGAACGAAAT 


18 


43.3 


GGACAGGAGCGGTATCACA 


6 


18.2 


AGACAGAGCTTGAGAATAA 


67 


64.1 


GAGAAGATCTTTATGCAAA 


60 


48.7 


GAAGAGAAATCAGCAGATA 


58 


45.7 


GCAAGTAACTCAACTAACA 


56 


72.3 


GAGCTAATCTGCCACATTG 


49 


-12.4 


GCAGATGAGTTACTAGAAA 


44 


48.9 


CAACTTAATTGTCCAGAAA 


41 


28.2 


CAACACAGGATTCTGATAA 


33 


-5.8 


AGATTGTGCCTAAGTCTCT 






ATGAAGATCTGGAGGTGAA 


0 


-4.3 


TTTGAGACTTCTTGCCTAA 


-5 


-27.7 


AGATCACCCTCCTTAAATA 


15 


72.3 


CAACGGATTTGGTCGTATT 


27 


-2.8 
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GAPDH 


NM_002046 


SEQ.IDNO. 0358 


GAAATCCCATCACCATCTT 


24 


3.9 


GAPDH 


NM_002046 


SEQ.IDNO. 0359 


GACCTCAACTACATGGTTT 


22 


-22.9 


GAPDH 


NM__002046 


SEQ.IDNO. 0360 


TGGTTTACATGTTCCAATA 


9 


9,8 


c-Myc 




SEQ.IDNO. 0361 


GAAGAAATCGATGTTGTTT 


31 


-11.7 


c-Myc 




SEQ.IDNO. 0362 


ACACAAACTTGAACAGCTA 


22 


51.3 


c-Myc 




SEQ.IDNO. 0363 


GGAAGAAATCGATGTTGTT 


18 


26 


C"Myc 




SEQ.IDNO. 0364 


GAAACGACGAGAACAGTTG 


18 


-8.9 


MAP2K1 


>JM_002755 


SEQ.IDNO. 0365 


GCACATGGATGGAGGTTCT 


26 


16 


MAP2K1 


NM_002755 


SEQ.IDNO. 0366 


GCAGAGAGAGCAGATTTGA 


16 


0.4 


MAP2K1 


NM__002755 


SEQ.IDNO. 0367 


GAGGTTCrCTGGATCAAGT 


14 


15.5 


MAP2K1 


NM_002755 


SEQ.IDNO. 0368 


GAGCAGATTTGAAGCAACT 


14 


18.5 


MAP2K2 


NM_030662 


SEQ.IDNO. 0369 


CAAAGACGATGACTTCGAA 


37 


26.4 


MAP2K2 


NM_030662 


SEQ.IDNO. 0370 


GATCAGCATTTGCATGGAA 


24 


-0.7 


MAP2K2 


NM__030662 


SEQ.IDNO. 0371 


TCCAGGAGTTTGTCAATAA 


17 


-4.5 


MAP2K2 


NM_030662 


SEQ.IDNO. 0372 


GGAAGCTGATCCACCTTGA 


16 


59.2 


KNSL1{EG5) 


NM_^004523 


SEQ. ID NO. 0373 


GCAGAAATCTAAGGATATA 


53 


35.8 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0374 


CAACAAGGATGAAGTCTAT 


50 


18.3 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0375 


CAGCAGAAATCTAAGGATA 


41 


32.7 


KNSL1(EG5) 


NM_004523 


SEQ.IDNO. 0376 


CTAGATGGCTTTCTCAGTA 


39 


3.9 


CyclophilmA_ 


NM__021130 


SEQ.IDNO. 0377 


AGACAAGGTCCCAAAGACA 


-16 


58.1 


CyclophilinA_ 


NM_021130 


SEQ.IDNO. 0378 


GGAATGGCAAGACCAGCAA 


-6 


36 


CyclophilinA_ 


NM_021130 


SEQ.IDNO. 0379 


AGAATTATTCCAGGGTTTA 


-3 


16.1 


CyclophilinA_ 


NM_^021130 


SEQ.IDNO. 0380 


GCAGACAAGGTCCCAAAGA 


8 


8.9 


LAMINA/C 


NM_170707 


SEQ.IDNO. 0381 


AGAAGCAGCTTCAGGATGA 


31 


38.8 


LAMINA/C 


NM_170707 


SEQ.IDNO. 0382 


GAGCTTGACTTCCAGAAGA 


33 


22.4 


LAMINA/C 


NM_170707 


SEQ.IDNO. 0383 


CCACCGAAGTTCACCCTAA 


21 


27.5 


LAMINA/C 


NM_170707 


SEQ. ID NO. 0384 


GAGAAGAGCTCCTCCATCA 


55 


30.1 


CyclophilinB 


M60857 


SEQ.IDNO. 0385 


GAAAGAGCATCTACGGTGA 


41 


83.9 


CyclophilmB 


M60857 


SEQ.IDNO. 0386 


GAAAGGATTTGGCTACAAA 


53 


59.1 


CyclophilinB 


M60857 


SEQ.IDNO. 0387 


ACAGCAAATTCCATCGTGT 


-20 


28.8 


CyclophilinB 


M60857 


SEQ.IDNO. 0388 


GGAAAGACTGTTCCAAAAA 


2 


27 


DBIl 


NM_020548 


SEQ. ID NO. 0389 


CAACACGCCTCATCCTCTA 


27 


-7.6 


DBI2 


NM_020548 


SEQ.IDNO. 0390 


CATGAAAGCTTACATCAAC 


25 


-30.8 


DBI3 


NM_020548 


SEQ.IDNO. 0391 


AAGATGCCATGAAAGCTTA 


17 


22 


DBl4 


NM_020548 


SEQ.IDNO. 0392 


GCACATACCGCCTGAGTCT 


15 


3.9 


rLUCl 




SEQ.IDNO. 0393 


GATCAAATCTGAAGAAGGA 


57 


49.2 


rLUC2 




SEQ. ID NO. 0394 


GCCAAGAAGTTTCCTAATA 


50 


13.7 


rLUC3 




SEQ. ID NO. 0395 


CAGCATATCTTGAACCATT 


41 


-2.2 


rLUC4 




SEQ. ID NO. 0396 


GAACAAAGGAAACGGATGA 


39 


29.2 


SeAPl 


NM_031313 


SEQ.IDNO. 0397 


CGGAAACGGTCCAGGCTAT 


6 


26.9 



72 



SeAP2 


NM_^031313 


SEQ.IDNO. 0398 


GCTTCGAGCAGACATGATA 


4 


-11.2 


SeAP3 


NM__031313 


SEQ. ID NO. 0399 


CCTACACGGTCCTCCTATA 


4 


4.9 


SeAP4 


NM_031313 


SEQ.nDNO. 0400 


GCCAAGAACCTCATCATCT 


1 


-9.9 


fLUCl 




SEQ. ID NO. 0401 


GATATGGGCTGAATACAAA 


54 


40.4 


fLUC2 




SEQ. ID NO. 0402 


GCACTCTGATTGACAAATA 


47 


54.7 


fLUC3 




SEQ. ED NO. 0403 


TGAAGTCTCTGATTAAGTA 


46 


34.5 


fLUC4 




SEQ.IDNO. 0404 


TCAGAGAGATCCTCATAAA 


40 


11.4 


mCyclo_l 


NM^008907 


SEQ.IDNO. 0405 


GCAAGAAGATCACCATTTC 


52 


46.4 


inCyclo_2 


NM_008907 


SEQ.IDNO. 0406 


GAGAGAAATTTGAGGATGA 


36 


70.7 


mCyclo_3 


NM_008907 . 


SEQ.IDNO. 0407 


GAAAGGATTTGGCTATAAG 


35 


-1.5 


inCyclo_4 


NM_008907 


SEQ. ID NO. 0408 


GAAAGAAGGCATGAACATT 


27 


10.3 


BCL2_1 


NM_000633 


SEQ.IDNO. 0409 


GGGAGATAGTGATGAAGTA 


21 


72 


BCL2_2 


NM_000633 


SEQ. ID NO. 0410 


GAAGTACATCCATTATAAG 


1 


3.3 


BCL2_3 


NM_0p0633 


SEQ. ID NO. 0411 


GTACGACAACCGGGAGATA 


1 


35.9 


BCL2_4 


NM_000633 


SEQ.IDNO. 0412 


AGATAGTGATGAAGTACAT 


-12 


22.1 


BCL2__5 


NM_000633 


SEQ.IDNO. 0413 


TGAAGACTCTGCTCAGTTT 


36 


19.1 


BCL2_6 


NM_000633 


SEQ.IDNO. 0414 


GCATGCGGCCTCTGTTTGA 


5 


-9.7 


QBl 


NM_003365.1 


SEQ. ID NO. 0415 


GCACACAGCUUACUACAUC 


52 


-4.8 


QB2 


NM_^003365.1 


SEQ. ID NO. 0416 


GAAAUGCCCUGGUAUCUCA 


49 


22.1 


QB3 


NM_003365.1 


SEQ.IDNO. 0417 


GAAGGAACGUGAUGUGAUC 


34 


22.9 


QB4 


NM_003365.1 


SEQ.IDNO. 0418 


GCACUACUCCUGUGUGUGA 


28 


20.4 


ATEl-1 


NM_007041 


SEQ.IDNO. 0419 


GAACCCAGCUGGAGAACUU 


45 


15.5 


ATE1.2 


NM_^007041 


SEQ. ID NO. 0420 


GAUAUACAGUGUGAUCUUA 


40 


12.2 


ATEl-3 


NM_007041 


SEQ. ID NO. 0421 


GUACUACGAUCCUGAUUAU 


37 


32.9 


ATEl^ 


NM_007041 


SEQ.IDNO. 0422 


GUGCCGACCUUUACAAUUU 


35 


18.2 


EGFR-1 


NM_005228 


SEQ.IDNO. 0423 


GAAGGAAACTGAATTCAAA 


68 


79.4 


EGFR-1 


NM__005228 


SEQ.IDNO. 0424 


GGAAATATGTACTACGAAA 


49 


49,5 


EGFR-1 


NM_005228 


SEQ. ID NO. 0425 


CCACAAAGCAGTGAATTTA 


41 


7.6 


EGFR-1 


NM_005228 


SEQ.IDNO. 0426 


GTAACAAGCTCACGCAGTT 


40 


25.9 



Example VII. Genome-Wide Application of the Algorithm 

The examples described above demonstrate that the algorithm(s) can 
5 successfully identify functional siRNA and that these duplexes can be used to induce 
the desirable phenotype of transcriptional knockdown or knockout. Each gene or 
family of genes in each organism plays an important role in maintaining physiological 
homeostasis and the algorithm can be used to develop functional, highly fanctional, or 
hyperfunctional siRNA to each gene. To accomplish this for the human genome, the 
10 entire onhne ncbi refseq database was accessed through Entrez (efetch). The database 
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was processed through Formula VIII. For each gene the top 80 -1 00 scores for 
siRNAs were obtained and BLAST'ed to insure that the selected sequences are 
specific in targeting the gene of choice. These sequences are provided on the 
enclosed CDs in electronic form. 

5 

With respect to the disks, there are four tables on each disk copy in text 
format: Tables XII --XV. Table XII, which is located in a file entitled Table_12.txt, 
provides a list of the 80-100 sequences for each target, identified by Formula VIII as 
having the highest relative SMARTscores™ for the target analyzed. Table XIII, 

10 which is located in a file entitled Table_13.txt, provides the SMARTscores™, an^ 
each gene, a pool pick of up to four sequences is denoted. (The denotation of "1" in 
Table XIII means that it is a pool pick.) These pool pick sequences represent the most 
fimctional siRNAs for the corresponding target. Any 1, 2, 3, or 4 of the pool pick 
sequences could be used for gene silencing. Further, sequences that are not denoted 

15 as pool pick sequences, but that are included on the compact disks may also be used 
for gene silencing either alone or in combination with other sequences. However, 
their individual relative functionality would be less than that of a pool pick sequence. 
Table XIV, which is located in a file entitled Table_14.txt, provides an identification 
of genes by accession number, and Table XV, which is located in a file entitled 

20 Table_l 5.txt, provides a short name for th?^ genes identified on the disk. The 
information contained on the disks is part of this patent application and are 
incorporated into the specification by reference. One may use these tables in order to 
identify functional siRNAs for the gene provided therein, by simply looking for the 
gene of interest and an siRNA that is listed as functional. Preferably, one would 

25 select one or more of the siRNA that most optimized for the target of interest and is 
denoted as a pool pick. 



Table XII: siRNA selected by Formula VIII 

See data submitted herewith on a CD-ROM in accordance with PCT 
30 Administrative Instructions Section 80 1 (a) 

Table XHI: SMARTscores™ 

See data submitted herewith on a CD-ROM in accordance with PCT 
Administrative Instructions Section 801(a) 
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Table XIV: Identification of Targets 

See data submitted herewith on a CD-ROM in accordance with PCT 
Administrative Instructions Section 801(a) 

5 

Table XV: Description of Targts 

See data submitted herewith on a CD-ROM in accordance with PCT 
Administrative Instructions Section 801(a) 

1 0 Many of the genes to which the described siRNA are directed play critical 

roles in disease etiology. For this reason, the siRNA listed in the accompanying 
compact disk may potentially act as therapeutic agents. A number of prophetic 
examples follow and should be understood in view of the siRNA that are identified on 
the accompanying CD. To isolate these siRNA, the appropriate message sequence for 

15 each gene is analyzed using one of the before mentioned formulas (preferably formula 
VIII) to identify potential siRNA targets. Subsequently these targets are BLAST'ed to 
eliminate homology with potentially off-targets. 

The list of potential disease targets is extensive. For instance, over-expression 
20 of BcUO has been implicated in the development of MALT lymphoma (mucosa 

associated lymphoid tissue lymphoma) and thus, functional, highly functional, or 

hyperfimctional siRNA directed against that gene {e.g. SEQ. ID NO. 0427: 

GGAAACCUCUCAUUGCUAA; SEQ. ID NO. 0428: 

GAAAGAACCUUGCCGAUCA; SEQ. ID NO. 0429: 
25 GGAAAUACAUCAGAGCUUA, or SEQ. ID NO. 0430: 

GAAAGUAUGUGUCUUAAGU) may contribute to treatment of this disorder. 

In another example, studies have shown that molecules that inhibit 
glutamine:fiiictose-6-phosphate aminotransferase (GFA) may act to limit the 
30 symptoms suffered by Type II diabetics. Thus, functional, highly functional, or 

hyperfimctional siRNA directed against GFA (also known as GFPTl : siRNA = SEQ. 
ID NO. 0433 UGAAACGGCUGCCUGAUUU; SEQ. ID NO. 0434 
GAAGUUACCUCUUACAUUU; SEQ. ID NO. 0435 
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GUACGAAACUGUAUGAUUA; SEQ. ID NO. 0436 

GGACGAGGCUAUCAUUAUG) may contribute to treatment of this disorder. 

In another example, the von Hippel-Lihdau (VHL) tumor suppressor has been 

5 observed to be inactivated at a high frequency in sporadic clear cell renal cell 
carcinoma (RCC) and RCCs associated with VHL disease. The VHL tumor 
suppressor targets hypoxia-inducible factor- 1 alpha (HIF-1 alpha), a transcription 
factor that can induce vascular endothelial growth factor (VEGF) expression, for 
ubiquitination and degradation. Inactivation of VHL can lead to increased levels of 

10 HIF-1 alpha, and subsequent VEGF over expression. Such over expression of VEGF 
has been used to explain the increased (and possibly necessary) vascularity observed 
in RCC. Thus, functional, highly functional/ or hyperfiinctional siRNAs directed 
against either HIF-1 alpha (SEQ. ID NO. 0437 GAAGGAACCUGAUGCUUUA; 
SEQ. ID NO. 0438 GCAUAUAUCUAGAAGGUAU; SEQ. ID NO. 0439 

15 GAACAAAUACAUGGGAUUA; SEQ. ID NO. 0440 

GGACACAGAUUUAGACUUG) or VEGF (SEQ. ID NO. 0441 
GAACGUACUUGCAGAUGUG; SEQ. ID NO. 0442 
GAGAAAGCAUUUGUUUGUA; SEQ. ID NO. 0443 
GGAGAAAGCAUUUGUUUGU; SEQ. ID NO. 0444 

20 CGAGGCAGCUUGAGUUAAA) may be usefixUn the treatment of renal cell 
carcinoma. 

■ . In another example, gene expression of platelet derived growth factor A and B 
(PDGF-A and PDGFrB) has been observed to be increased 22- and 6-fold, 

25 respectively, in renal tissues taken from patients with diabetic nephropathy as 

compared with controls. These findings suggest that over expression of PDGF A and 
B may play a role in the development of the progressive fibrosis that characterizes 
human diabetic kidney disease. Thus, functional, highly functional, or hyperfunctional 
siRNAs directed against either PDGF A 

30 (SEQ. ID NO. 0445: GGUAAGAUAUUGUGCUUUA; 
SEQ. ID NO. 0446: CCGCAAAUAUGCAGAAUUA; 
SEQ. ID NO. 0447: GGAUGUACAUGGCGUGUUA; 
SEQ. ID NO. 0448: GGUGAAGUUUGUAUGUUUA) or 



PDGFB 

(SEQ. ID NO. 0449: CCGAGGAGCUUUAUGAGAU; 
SEQ. ID NO. 0450: GCUCCGCGCUUUCCGAUUU; 
SEQ. ID NO. 0451 GAGCAGGAAUGGUGAGAUG; 
5 SEQ. ID NO. 0452: GAACUUGGGAUAAGAGUGU; 
SEQ. ID NO. 0453 CCGAGGAGCUUUAUGAGAU; 

SEQ. ID NO. 0454 UUUAUGAGAUGCUGAGUGA) may be useful in the treatment 
of this form of kidney disorder. 

10 In another example, a strong correlation exists between the over-expression of 

glucose transporters (e.g. GLUT12) and cancer cells. It is predicted that cells 

undergoing uncontrolled cell growth up'regulate GLUT molecules so that they can 

cope with the heightened aiergy needs associated with increased rates of proUferation 

and metastasis. Thus, siRNA-based therapies that target the molecules such as 
1 5 GLUTl (also known as SLC2A1 : siRNA= 

SEQ. ID NO.: 0455 GCAAUGAUGUCCAGAAGAA; 

SEQ. ID NO.: 0456 GAAGAAUAUUCAGGACUUA; 

SEQ. ID NO.: 0457 GAAGAGAGUCGGCAGAUGA; 

SEQ. ID NO.: 0458 CCAAGAGUGUGCUAAAGAA) 
20 .^^r 

GLUT12 (also known as SLCA12: siRNA = 

SEQ. ID NO. 0459: GAGACACUCUGAAAUGAUA; 

SEQ. ID NO. 0460: GAAAUGAUGUGGAUAAGAG; 

SEQ. ID NO. 0461 : GAUCAAAUCCUCCCUGAAA; 
25 SEQ. ID NO. 0462: UGAAUGAGCUGAUGAUUGU) and other related transporters, 

may be of value in treating a multitude of malignancies. 

The siRNA sequences listed above are presented in a 5'-> 3' sense strand 
direction. In addition, siRNA directed against the targets listed above as well as those 
30 directed against other targets and listed in the accompanying compact disk may be 
useful as therapeutic agents. 

Example VIII. Evidence for the Benefits of Pooling 
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Evidence for the benefits of pooling have been demonstrated using the 
reporter gene, luciferase. Ninety siRNA duplexes were synthesized using Dharmacon 
proprietary ACE® chemistry against one of the standard reporter genes: firefly 
luciferase. The duplexes were designed to start two base pairs apart and to cover 
5 approximately 1 80 base pairs of the luciferase gene (see sequences in Table III). 
Subsequently, the siRNA duplexes were co-transfected with a luciferase expression 
reporter plasmid into HEK293 cells using standard transfection protocols and 
luciferase activity was assayed at 24 and 48 hours. 



1 0 Transfection of individual siRN As showed standard distribution of inhibitory 

effect. Some duplexes were active, while others were not. Figure 15 represents a 
typical screen of ninety siRNA^duplexes (SEQ. ID NO. 0032- 0120) positioned two 
base pairs apart. As the figure suggests, the functionality of the siRNA duplex is 
determined more by a particular sequence of the oligonucleotide than by the relative 

1 5 oligonucleotide position within a gene or excessively sensitive part of the mRNA, 
which is important for traditional anti-sense technology. 

When two continuous oligonucleotides were pooled together, a significant 
increase in gene silencing activity was observed. (See Figure 16) A gradual increase 
20 in efficacy and the fi-equency of pools functionality was observed when the number of 
siRNAs increased to 3 and 4. (Figures 16, 17). Further, the relative positioning of 
the oligonucleotides within a pool did not determine whether a particular pool was 
functional (see Figure 18, in which 100% of pools of oligonucleotides distanced by 2, 
10 and 20 base pairs were functional). 

25 

However, relative positioning may nonetheless have an impact. An increased 
functionality may exist when the siRNA are positioned continuously head to toe (5' 
end of one directly adjacent to the 3' end of the others). 



Additionally, siRNA pools that were tested performed at least as well as the 
best oligonucleotide in the pool, under the experimental conditions whose results are 
depicted in Figure 19. Moreover, when previously identified non-functional and 
marginally (semi) functional siRNA duplexes were pooled together in groups of five 
at a time, a significant ftinctional cooperative action was observed. (See Figure 20) 
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In fact, pools of semi-active oligonucleotides were 5 to 25 times more functional than 
the most potent oligonucleotide in the pool Therefore, pooling several siRNA 
duplexes together does not interfere with the functionality of the most potent siRNAs 
within a pool, and pooling provides an unexpected significant increase in overall 
5 functionality 

Example IX. PooliBg Across Species 

Experiments were performed on the following genes: p-galactosidase, Renilla 
luciferase, and Secreted alkaline phosphatase, which demonstrates the benefits of 
1 0 pooling, (see Figure 21) Approximately 50% of individual siRNAs designed to 
silence the above-specified genes were functional, while 100% of the pools that 
contain the same siRNA^dtip'lex^ were functional. 

Example X. Highly Functional siRNA 

1 5 Pools of five siRNAs in which each two siRNAs overlap to 10-90% resulted 

in 98% fiinctional entities (>80% silencing). Pools of siRNAs distributed throughout 
the mRNA that were evenly spaced, covering an approximate 20 - 2000 base pair 
range, were also functional. When the pools of siRNA were positioned continuously 
head to tail relative to mRNA sequences and mimicked the natural products of Dicer 

20 cleaved long double stranded RNA, 98% of the pools evidenced highly functional 
activity (>95% silencing). 

Example XI; Buman cyclophyline 

Table III above Usts the siRNA sequences for the human cyclophyline 
25 protein. A particularly functional siRNA may be selected by applying these 
sequences to any of Formula I to VII above. 

Alternatively, one could pool 2, 3, 4, 5 or more of these sequences to create a 
kit for silencing a gene. Preferably, within the kit there would be at least one 
30 sequence that has a relatively high predicted functionality when any of Formulas I - 
VII is applied. 

Example XII* Sample Pools of siRNAs and Their Application to Human Disease 
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The genetic basis behind human disease is well documented and siRNA may 
be used as both research or diagnostic tools and thorapeutic agents, either individually 
or in pools. Genes involved in signal transduction, the immune response, apoptosis, 
DNA repair, cell cycle control, and a variety of other physiological ftinctions have 
5 clinical relevance and therapeutic agents that can modulate expression of these genes 
may alleviate some or all of the associated symptoms. In some instances, these genes 
can be described as a member of a family or class of genes and siRNA (randomly, 
conventionally, or rationally designed) can be directed against one or multiple 
members of the family to induce a desired result. 

10 

To identify rationally designed siRNA to each gene, the sequence was 
analyzed using Formula^¥III to identify a SMARTpool containing the functional 
sequences. To confirm the activity of these sequences, flie siRNA are introduced into 
a cell type of choice (e.g. HeLa cells, HEK293 cells) and the levels of the appropriate 

1 5 message are analyzed using one of several art proven techniques, SiRNA having 

heightened levels of potency can be identified by testing each of the before mentioned 
duplexes at increasingly limiting concentrations. Similarly, siRNA having increased 
levels of longevity can be identified by introducing each duplex into cells and testing 
fimctionality at 24, 48, 72, 96, 120, 144, 168, and 192 hours after transfection. Agents 

20 that induce >95% silencing at sub-nanomolar concentrations and/or induce fiinctional 
levels of silencing for >96 hours are considered hyperfimctional. 

The folkg^Wg are non-limiting examples of families of proteins to which 
siRNA described in this document are targeted against: 

25 

Transporters, Pumps, and Channels 

Transporters, pumps, and channels represent one class of genes that are 
30 attractive targets for siRNAs. One major class of transporter molecules are the ATP- 
binding cassette (ABC) transporters. To date, nearly 50 human ABC-transporter 
genes have been characterized and have been shown to be involved in a variety of 
physiological ftinctions including transport of bile salts, nucleosides, chloride ions, 
cholesterol, toxins, and more. Predominant among this group are MDRl (which 
35 encodes the P-glycoprotein, NP_00091 8), the MDR-related proteins (MRPl-7), and 
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the breast cancer resistance protein (BCRP). In general, these transporters share a 
conmion structure, with each protein containing a pair of ATP-binding domains (also 
known as nucleotide binding folds, NBF) and two sets of transmembrane (TM) 
domains, each of which typically contains six membrane-spanning a-helices. The 
5 genes encoding this class of transporter are organized as either full transporters (i.e, 
containing two TM and two NBF domains) or as half transporters that assemble as 
either homodimers or heterodimers to create functional transporters. As a whole, 
members of the family are widely dispersed throughout the genome and show a high 
degree of amino acid sequence identify among eukaryotes. 

10 

ABC-transporters have been implicated in several human diseases. For 
instance, molecM&f^^ffliix pumps of this type play a major role in the development of 
drug resistance exhibited by a variety of cancers and pathogenic microorganisms. In 
the case of human cancers, increased expression of the MDRl gene and related pumps 

15 have been observed to generate drug resistance to a broad collection of commonly 
used chemotherapeutics including doxorubicin, daimorubicin, vinblastine, vincristine, 
colchicines. In addition to the contribution these transporters make to the 
development of multi-drug resistance, there are currently 13 human genetic diseases 
associated with defects in 14 different transporters. The most common of these 

20 conditions include cystic fibrosis, Stargardt disease, age-related macular degeneration, 
adrenoleukodystrophy, Tangier disease, Dubin-Johnson syndrome and progressive 
familial intrahepatic cholestasis. For this reason, siRNAs directed against members of 
this, and related j 'iaiaailiesi are potentially valuable research and therapeutic tools. 

25 With respect to channels, analysis of Drosophila mutants has enabled the 

initial molecular isolation and characterization of several distinct channels including 
(but not limited to) potassium (K+) channels. This list includes shaker (Sh), which 
encodes a voltage activated K"^ channel, slowpoke (Slo), a Ca^^ activated channel, 
and ether-a-go-go (Bag). The Bag family is further divided into three subfamilies: 

30 Bag, Elk (eag-like K channels), and Erg (Bag related genes). 

The Erg subfamily contains three separate family members (Brgl-3) that are 
distantly related to the sh family of voltage activated channels. Like sh, erg 
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polypetides contain the classic six membrane spanning architecture of channels 
(S1-S6) but differ in that each includes a segment associated with the C-terminal 
cytoplasmic region that is homologous to cyclic nucleotide binding domains (cNBD). 
Like many isolated ion channel mutants, erg mutants are temperature-sensitive 
5 paralytics, a phenotype caused by spontaneous repetitive firing (hyperactivity) in 
neurons and enhanced transmitter release at the neuromuscular junction. 

Initial studies on the tissue distribution of all three members of the erg 
subfamily show two general patterns of expression. Ergl and erg3 are broadly 

10 expressed throughout the nervous system and are observed in the heart, the superior 
mesenteric ganglia, the celiac ganglia, the retina, and the brain. In contrast, erg2 
shows a much-niore-restricted pattern of expression and is only observed in celiac 
ganglia and superior mesenteric ganglia. Similarly, the kinetic properties of the three 
erg potassium channels are not homogeneous. Ergl and erg2 channels are relatively 

15 slow activating delayed rectifiers whereas the erg3 current activates rapidly and then 
exhibits a predominantly transient component that decays to a sustained plateau. The 
current properties of all three channels are sensitive to methanesulfonanilides, 
suggesting a high degree of conservation in the pore structure of all three proteins. 

20 Recently, the erg family of channels has been implicated in human disease. 

Consistent with the observation that ergl is expressed in the heart, single strand 
conformation polymorphism and DNA sequence analyses have identified HERG 
(human ergl) mutaticffl§dll;?six. long-QT-syndrome (LQT) families, an inherited 
disorder that results in sudden death firom a ventricular tachyarrythmia. Thus siRNA 

25 directed against this group of molecules (e.g. KCNH 1 -8) will be of extreme 
therapeutic value. 

Another group of channels that are potential targets of siRNAs are 
the CLCA family that mediate a Ca^'**-activated CP conductance in a variety of 
30 tissues. To date, two bovine (bCLCl ; bCLCA2 (Lu-ECAM-1)), three mouse 

(mCLCAl ; mCLCA2; mCLCA3) and four human (hCLCAl ; hCLCA2; hCLCA3; 
hCLCA4) CLCA family members have been isolated and patch-clamp studies with 
transfected human embryonic kidney (HEK-293) cells have shown that bCLCAl, 
mCLCAl, and hCLCAl mediate a Ca^'^-activated CI" conductance that can be 
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inhibited by the anion channel blocker DIDS and the reducing agent dithiothreitol 
(DTT). 

The protein size, structure, and processing seem to be similar among different 
5 CLCA family members and has been studied in greatest detail for Lu-ECAM-1 . The 
Lu-ECAM-1 open reading frame encodes a precursor glycoprotein of 130 kDa that is 
processed to a 90-kDa amino-terminal cleavage product and a group of 30- to 40-kDa 
glycoproteins that are glycosylation variants of a single polypeptide derived from its 
carboxy terminus. Both subunits are associated with the outer cell surface, but only 
1 0 the 90-kDa subunit is thought to be anchored to flie cell membrane via four 
transmembrane domains. 

Although the protein processing and function appear to be conserved among 
CLCA homologs, significant differences exist in their tissue expression patterns. For 

15 example, bovine Lu-ECAM-1 is expressed primarily in vascular endothelia, bCLCAl 
is exclusively detected in the trachea, and hCLCAl is selectively expressed in a 
subset of human intestinal epithelial cells. Thus the emerging picture is that of a 
multigene family with members that are highly tissue specific, similar to the CIC 
family of voltage-gated CT channels. The human channel, hCLCA2, is particular 

20 interesting from a medical and pharmacological standpoint. CLCA2 is expressed on 
the Itiminal surface of lung vascular endothelia and serves as an adhesion molecule 
for lung metastatic cancer cells, thus mediating vascular arrest and lung colonization. 
Expression of this moleciirfiya^ionnal mammary epithelium is consistently lost in 
human breast cancer and in nearly all tumorigenic breast cancer cell lines. Moreover, 

25 re-expression of hCLCA2 in human breast cancer cells abrogates tumorigenicity in 
nude mice, implying that hCLC A2 acts as a tumour suppressor in breast cancer. For 
these reasons, siRNA directed against CLCA family members and related channels 
may prove to be valuable in research and therapeutic venues. 

30 Transporters Involved m Synaptic Transmission. 

Synaptic transmission involves the release of a neurotransmitter into the 
synaptic cleft, interaction of that transmitter with a postsynaptic receptor, and 
subsequent removal of the transmitter from the cleft. In most synapses the signal is 
terminated by a rapid reaccumulation of the neurotransmitter into presynaptic 
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terminals. This process is catalyzed by specific neurotransmitter transporters that are 
often energized by the electrochemical gradient of sodium across the plasma 
membrane of the presynaptic cells. 

Aminobutyric acid (GAB A) is the major inhibitory neurotransmitter in the 
central nervous system. The inhibitory action of GABA, mediated through GABAa/ 
GABAb receptors, and is regulated by GABA transporters (GATs), integral 
membrane proteins located perisynaptically on neurons and glia. So far four different 
carriers (GAT1-GAT4) have been cloned and their cellular distribution has been 
partly worked out. Comparative sequence analysis has revealed that GABA 
transporters are related to several other proteins involved in neurotransmitter uptake 
including gamma-aminobutyric add transporters, monoamine transporters, amino 
acid transporters, certain "orphan" transporters, and the recently discovered bacterial 
transporters. Each of these proteins has a similar 12 transmembrane helices topology 
and relies upon the Na+/Cl- gradient for transport function. Transport rates are 
dependent on substrate concentrations, with half-maximal effective concentrations for 
transport frequently occurring in the submicromolar to low micromolar range. In 
addition, transporter function is bidirectional, and non- vesicular efflux of transmitter 
may contribute to ambient extracellular transmitter levels. 

Recent evidence suggests that GABA transporters, and neurotransmitter 
transporters in general, are not passive players in regulating neuronal signaling; rather, 
transporter function can be^Ueiee^y a variety of initiating factors and signal 
transduction cascades. In general, this functional regulation occurs in two ways, 
either by changing the rate of transmitter flux through the transporter or by changing 
the number of functional transporters on the plasma membrane. A recurring theme in 
transporter regulation is the rapid redistribution of the transporter protein between 
intracellular locations and the cell surface. In general, this functional modulation 
occurs in part through activation of second messengers such as kinases, phosphatases, 
arachidonic acid, and pH. However, the mechanisms underlying transporter 
phosphorylation and transporter redistribution have yet to be fiiUy elucidated. 

GABA transporters play a pathophysiological role in a number of human 
diseases including temporal lobe epilepsy and are the targets of pharmacological 



interventions. Studies in seizure sensitive animals show some (but not all) of the GAT 
transporters have altered levels of expression at times prior to and post seizure, 
suggesting this class of transporter may affect epileptogenesis, and that alterations 
following seizure may be compensatory responses to modulate seizure activity. For 
these reasons, siRNAs directed against members of this family of genes (including but 
not limited to SLCG6A1-12) may prove to be valuable research and therapeutic tools. 

Organic Ion Transporters. 

The human body is continuously exposed to a great variety of xenobiotics, via 
food, drugs, occupation, and environment. Exoretory organs such as kidney, liver, and 
intestine defaid the body against the potentially harmful effects of these compounds 
by transforming them into less active metaboUtes that are subsequently secreted fiom 
the system. 



Carrier-mediated transport of xenobiotics and their metabolites exist for the 
active secaretion of organic anions and cations. Both systems are characterized by a 
high clearance capacity and tremendous diversity of substances accepted, properties 
that result from the existence of multiple transportere with overlappmg substrate 
specificities. The class of organic anion transporters plays a critical role in the 
elimination of a large number of drugs {e.g., antibiotics, chemotherapeutics, diuretics, - 
nonsteroidal anti-inflammatory drugs, radiocontrast agents, cytostatics); drug 
metaboUtes (especially conjugation products with glutathione, glucuronide, glycine, 
sulfate, acetate); and toxicants.am4Afifemetabolites (e.^., mycotoxins, herbicides, 
plastidzers, glutathione .S-conjugates of polyhaloalkanes,pol5*aloalkenes, 
hydroquinones, aminophenols), many of whidi are specifically harmful to the kidney. 

Over the past couple of years the number of identified anion transporting 
molecules has grown tremendously. Uptake of organic anions (OA") across the 
basolateral membrane is mediated by the classic sodium-dependent organic anion 
transport system, which includes a-ketoglutarate (a-KG^~)/OA" exchange via the 
organic anion transporter (OATl) and sodium-ketoglutarate cotransport via the 
NaVdicarboxylate cotransporter (SDCT2). The organic anion transporting polypetide, 
Oatpl, and the kidney-specific OAT-Kl and 0AT-K2 are seen as potential molecules 
that mediate facilitated OA" efflux but could also be involved in reabsoiption via an 
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exchange mechanism. Lastly the PEPTl and PEPT2 mediate luminal uptake of 
peptide drugs, whereas CNTl and CNT2 are involved in reabsorption of nucleosides 

The organic anion-transporting polypq)tide 1 (Oatpl) is aNa*- and ATP- 
independent transporter originally cloned from rat liver. The tissue distribution and 
transport properties of the Oatpl gene product are complex. Oatpl is localized to the 
basolateral membrane of hepatocytes, and is found on the apical membrane of S3 
proximal tubules. Studies with transiently transfected cells {e.g. HeLa cells) have 
indicated that Oatpl mediates transport of a variety of molecules including 
taurocholate, estrone-3 -sulfate, aldosterone, Cortisol, and others. The observed uptake 
of taurocholate by Oatpl expressed inX. laevis oocytes is accompanied by efHux of 
GSHj suggesting that transport by this molecule may be glutathione dependent. 

Computer modeling suggests that members of the Oatp family are highly 
conserved, hydrophobic, and have 1 2 transmembrane domains. Decreases in 
expression of Oa^ family members have been associated with cholestatic liver 
diseases and human hepatoblastomas, making this family of proteins of key interest to 
researchers and the medical community. For these reasons, siRNAs directed against 
OAT family members (including but not hmited to SLC21 A2, 3, 6, 8, 9, 11, 12, 14^ 
15, and related transporters) are potentially useful as research and therapeutic tools. 

Nucleoside transporters. 

Nucleoside transporters pla$yfi^$;^les in physiology and pharmacology. 
Uptake of exogenous nucleosides is a critical first step of nucleotide synthesis in 
tissues such as bone marrow and intestinal epithelium and certain parasitic organisms 
that lack de novo pathways for purine biosynthesis. Nucleoside transporters also 
control the extracellular concentration of adenosine in the vicinity of its cell surface 
receptors and regulate processes such as neurotransmission and cardiovascular 
activity. Adenosine itself is used clinically to treat cardiac arriiythmias, and nucleoside 
transport inhibitors such as dipyridamole, dilazep, and draflazine function as coronary 
vasodilators. 

hi mammals, plasma membrane transport of nucleosides is brought about by 
members of the concenfa-ative, Na*-dependent (CNT) and equilibrative, Na""- 
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independent (ENT) nucleoside transporter families. CNTs are expressed in a tissue- 
specific fashion; ENTs are present in most, possibly all, cell types and are responsible 
for the movement of hydrophilic nucleosides and nucleoside analogs down their 
concentration gradients. In addition, structure/function studies of ENT family 
5 members have predicted these molecules to contain eleven transmembrane helical 
segments with an amino terminus that is intracellular and a carboxyl terminus that is 
extracellular. The proteins have a large glycosylated loop between TMs 1 and 2 and 
a large cytoplasmic loop between TMs 6 and 7. Recent investigations have implicated 
the TM 3-6 region as playing a central role in solute recognition. The medical 

1 0 importance of the ENT family of proteins is broad. In humans adaiosine exerts a 
range of cardioprotective effects and inhibitors of ENTs are seen as being valuable in 
- alleviating a variety of cardio/cardiovascular aihnents. hi addition, responses to 
nucleoside analog drugs has beai observed to vary considerably amongst e.g. cancer 
patients. While some forms of drug resistance have been shown to be tied to the up- 

15 regulation of ABC-transporters ie.g. MDR4), resistance may also be the resuh of 
reduced drug uptake (i.e. reduced ENT expression). Thus, a clearer understanding of 
ENT transporters may aid in optimizing drug treatments for patients suffering a wide 
range of malignancies. For these reasons, siRNAs directed against this class of 
molecules (including SLC28A1-3, SLC29A1-4, and related molecules) may be useful 

20 -as therapeutic and research tools. 

Sulfate Transporters. 

All cells require inorganic sulfaleioiaflrmal fimction. Sulfete is the fourth 
most abundant anion in human plasma and is the major source of sulfur in many 
25 organisms. Sulfation of extracellular matrix proteins is critical for maintaining 
normal cartilage metabolism and sulfate is an important constituent of myelin 
membranes found in the brain 

Because sulfate is a hydrophilic anion that cannot passively cross the lipid 
30 bilayer of cell membranes, all cells require a mechanism for sulfate influx and efflux 
to ensure an optimal supply. To date, a variety of sulfate transporters have been 
identified in tissues from many origins. These include the renal sulfate transporters 
(NaSi-1 and Sat-1), the ubiquitously expressed diastrophic dysplasia sulfate 
transporter (DTDST), the intestinal sulfate hansporter (DRA), and the erythrocyte 
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anion exchanger (AEl). Most, if not all, of these molecules contain the classic 12 
transmembrane spanning domain architecture commonly found amongst members of 
the anion transporter superfamily. 

Recently three different sulfate transporters have been associated with specific 
human genetic diseases. Family members SLC26A2, SLC26A3, and SLC26A4 have 
been recognized as the disease genes mutated in diastrophic dysplasia, congenital 
chloride diarrhea (CLD), and Pendred syndrome (PDS), respectively. DTDST is a 
particularly complex disorder. The gene encoding (his molecule maps to chromosome 
5q, and encodes two distinct transcripts due to alternative exon usage. In contrast to 
other sulfate transporters (e.g. Sat-1) anion movement by the DTDST protein is 
- markedly inhibited by either extracellular chloride or bicarbonate. Impaired function 
of the DTDST gene product leads to undwsulfation of proteoglycans and a complex 
family of recessively inherited osteochondrodysplasias (achondrogenesis type IB, 
atelosteogenesis type II, and diastrophic dysplasia) widi clinical features including but 
not limited to, dwarfism, spinal deformation, and specific joint abnormalities. 
Interestingly, while epidemiological studies have shown that the disease occurs in 
most populations, it is particularly prevalent in Finland owing to an apparent founder 
effect. For these reasons, siRNAs directed against this class of genes (including but 
not limited to SLC26A1 -9, and related molecules) may be potentially helpful in both 
tho-apeutic and research v^ues. 



Ion Exchangers .^..^aj^ttt^ 

Intracelliilar pH regulatory mechanisms are critical for the maintenance of 
counUess cellular processes. For instance, in muscle cells, contractile processes and 
metabolic reactions are influenced by pH. During periods of increased energy 
demands and ischemia, muscle cells produce large amounts of lactic acid that, without 
quick and efficient disposal, would lead to acidification of the sarcoplasm. 

Several different transport medianisms have evolved to mamtain a relatively 
constant intracellular pH. The relative contribution of each of these processes varies 
with cell type, the metabolic requirements of the cell, and the local environmental 
conditions. Intracellular pH regulatory processes that have been diaracterized 
functionally include but are not limited to the Na*/H* exchange, the Na(HC03)„ 
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cotransport, and the Na^-dependent and -independent CF/base exchangers. As 
bicarbonate and CO2 comprise the major pH buffer of biological fluids, sodium 
biocarbonate cotransporters (NBCs) are critical. Studies have shown that these 
molecules exist in numerous tissues including the kidney, brain, liver, comea, heart, 
5 and lung, suggesting that NBCs play an important role in mediating HCOa"" transport 
in both epithelial as well as nonepithelial cells. 

Recent molecular cloning experiments have identified the existence of four 
NBC isoforms (NBCl, 2, 3 and 4) and two NBC-related proteins, AE4 and NCBE 

1 0 (Anion Exchanger 4 and Na-dependent Chloride-Bicarbonate Exchanger). The 

secondary structure analyses and hydropathy profile of this family predict them to be 
intrinsic membrane proteins with 12 putative transmembrane domains and several 
family members exhibit iV-linked glycosylation sites, protein kinases A and C, casein 
kinase II, and ATP/GTP-binding consensus phosphorylation sites, as well as potential 

1 5 sites for myristylation and amidation. AE4 is a relatively recent addition to this 
family of proteins and shows between 30-48% homology with the other family 
members. When expressed in COS-7 cells andXenopus oocytes AE4 exhibits sodium- 
independent and DIDS-insensitive anion exchanger activity. Exchangers have been 
shown to be responsible for a variety of human diseases. For instance, mutations in 

20 three genes of the anion transporter family (SLC) are believed to cause known 
hereditary diseases, including chondrodysplasia (SLC26A2, DTD), diarrhea (A3, 
down-regulated in adenoma/chloride-losing diarrhea protein: DRA/CLD), and 
goiter/deafeess syndrome (A4, pendrin). Moreever^utations in Na+/HC03 co- 
transporters have also been associated with various human maladies. For these 

25 reasons, siRNAs directed against tihiese sorts of genes (e.g. SLC4A4-10, and related 
genes) may be useful for therapeutic and research purposes. 

Receptors Involved in Synaptic Transmission 

In all vertebrates, fast inhibitory synaptic transmission is the result of the 
30 interaction between the neurotransmitters glycine (Gly) and y-aminobutyric acid 
(GABA) and their respective receptors. The strychnine-sensitive glycine receptor is 
especially important in that it acts in the mammalian spinal cord and brain stem and 
has a well-established role in the regulation of locomotor behavior. 
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Glycine receptors display signiiScant sequence homology to several other 
receptors including the nicotinic acetylcholine receptor, the aminobutyric acid 
receptor type A (GABAaR), and the serotonin receptor type 3 (5-HT3R) subunits. As 
5 members of the superfamily of ligand-gated ion channels, these polypeptides share 
common topological features. The glycine receptor is composed of two types of 
glycosylated integral membrane proteins (al-a4 and P) arranged in a pentameric 
suprastructure. The alpha subunit encodes a large extracellular, N-terminal domain 
that carries the structural determinants essential for agonist and antagonist binding, 
1 0 followed by four transmembrane spanning regions (TMl -TM4), with TM2 playing the 
critical role of forming the inner wall of the chloride channel. 

The density, location, and subunit composition of glycine neurotransmitter 
receptors changes over the coiirse of development. It has been observed that the 

1 5 amount of GlyR gene translation (assessed by the injection of developing rat cerebral 
cortex mRNA into Xenopus oocytes) decreases with age, whereas that of GABARs 
increases. In addition, the type and location of mRNAs coding for GlyR changes over 
the course of development. For instance in a study of the expression of alpha 1 and 
alpha 2 subunits in the rat, it was observed that (in embryonic periods El 1-18) the 

20 mantle zone "was scarce in the alpha 1 mRNA, but the geraiinal zone (matrix layer) at 
Ell-14 expressed higher levels of the message. At postnatal day 0 (PO), the alpha 1 
signals became manifested throughout the gray matter of the spinal cord. By contrast, 
the spinal tissues at PO exhibited the highest level8^ofialpha-2 mRNA, which 
decreased with the postnatal development. 

25 

In both, man and mouse mutant lines, mutations of GlyR subunit genes result 
in hereditary motor disorders characterized by exaggerated startle responses and 
increased muscle tone. Pathological alleles of the GlraJ gene are associated with the 
murine phenotypes oscillator (spcf^) and spasmodic (spd). Similarly, a mutant allele 
30 of Glrb has been found to underly the molecular pathology of the spastic mouse (spa). 
Resembling the situation in the mouse, a variety of GLRAl mutant alleles have been 
shown to be associated with the human neurological disorder hyperelq)lexia or startle 
disease. For these reasons, siRNA directed against glycine receptors (GLRAl-3, 
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GLRB, and related molecules), glutamate receptors, GABA receptors, ATP receptors, 
and related neurotransmitter receptor molecules may be valuable therapeutic and 
research reagents. 

5 Proteases 

Kallikreins 

One important class of proteases are the kallikreins, serine endopeptidases that 
split peptide substrates preferentially on the C-terminal side of internal arginyl and 

10 lysyl residues. Kallikreins are generally divided into two distinct groups, plasma 
kallikreins and tissue kallikreins. Tissue kallikreins represent a large group of 
enzymes that have substantial similarities at both the gene and protein level. The 
genes encoding this group are frequendy found on a single chromosome, are 
organized in clusters, and are expressed in a broad range of tissues (e.g. pancreas, 

1 5 ovaries, breast). In contrast, the plasma form of the enzyme is encoded by a single 
gene (e.g. KLK3) that has been localized to chromosome 4q34-35 in humans. The 
gene encoding plasma kallikrein is expressed solely in the liver, contains 15 exons, 
and encodes a glycoprotein that is translated as a preprotein called prekallikrein. 

20 Kallikreins are believed to play an important role in a host of physiological 

events For instance, the immediate consequence of plasma prekallikrein activation is 
the cleavage of high molecular weight kininogen (HK) and the subsequent liberation 
of bradykinin, a nine amino acid vasoactive peptide that is an important mediator of 
inflammatory responses. Similarly, plasma kallikrein promotes single-chain urokinase 

25 activation and subsequent plasminogen activation, events that are critical to blood 
coaggulation and woxmd healing. 

Disruptions in the function of kallikreins have been implicated in a variety of 
pathological processes including imbalances in renal function and inflammatory 
30 processes. For these reasons, siRNAs directed against this class of genes (e.g. KLKl- 
1 5) may prove valuable in both research and therapeutic settings. 
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ADAM Proteins 

The process of fertilization takes place in a series of discrete steps whereby the 
sperm interacts with, i) the cumulus cells and the hyaluronic acid extracellular matrix 
(ECM) in which they are embedded, ii) the egg's own ECM, called the zona pellucida 
(ZP), and iii) the egg plasma membrane. During the course of these interactions, the 
"acrosome reaction," the exocytosis of the acrosome vesicle on the head of the sperm, 
is induced, allowing the sperm to penetrate the ZP and gain access to the perivitelline 
space. This process exposes new portions of the sperm membrane, including the inner 
acrosomal membrane and the equatorial segment, regions of the sperm head that can 
participate in initial gamete membrane binding. 

The interactions of the ^mete plasma membranes appear to involve multiple 
ligands and receptors and are frequently compared to leukocyte-endothelial 
interactions. These interactions lead to a series of signal transduction events in the 
egg, known as collectively as egg activation and include the initiation of oscillations 
in intracellular calcium concentration, the exit from meiosis, the entry into the first 
embryonic mitosis, and the formation of a block to polyspermy via the release of ZP- 
modifying enzymes from the egg's cortical granules. Ultimately, sperm and egg not 
only adhere to each other but also go on to undergo manbrane fusion, making one 
cell (the zygote) from two. 

Studies on tfie process of sperm-egg interactions have identified a number of 
proteins that are CTudal for fertilization. One class of proteansj^calted the ADAM 
family (A Disintegrin And Metalloprotease), has been found to be important in 
spermatogenesis and fertilization, as well as various developmental systems including 
myogenesis and neurogenesis. Members of the family contain a disintegrin and 
metalloprotease domain (and therefore have (potentially) both cell adhesion and 
protease activities), as well as cysteine-rich regions, epidermal growth factor (EGF)- 
like domains, a transmembrane region, and a cytoplasmic tail. Currently, the ADAM 
gene family has 29 members and constituents are widely distributed in many tissues 
including the brain, testis, epididymis, ovary, breast, placenta, liver, heart, lung, bone, 
and muscle. 
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One of the best-studied members of the ADAM family is fertilin, a 
heterodimeric protein comprised of at least two subunits, fertilin alpha and fertilin 
beta. The fertilin beta gene (ADAM2) has been disrupted with a targeting gene 
construct corresponding to the exon encoding the fertilin beta disintegrin domain. 
Sperm from males homozygous for disruptions in this region exhibit defects in 
multiple facets of sperm ftmction including reduced levels of sperm transit from the 
uterus to the oviduct, reduced sperm-ZP binding, and reduced spenn-egg binding, all 
of which contribute to male infertility. 

Recentiy, four new ADAM family members (ADAM 24-27) have been 
isolated. The deduced amino add sequences show that all four contain the complete 
domain organization common to ADAM family members and Northern Blot analysis 
has shown all four to be specific to the testes. SiRNAs directed against this class of 
genes (e.g. ADAM2 and related proteins) may be useftil as research tools and 
therapeutics directed toward fertility and birfli confr-ol. 

Aminopeptidases 

Aminopeptidases are proteases that play critical roles in processes such as 
protein maturation, protein digestion in its terminal stage, regulation of hormone 
levels, selective or homeostatic protein turnover, and plasmid stabilization. These 
enzymes generally have broad substiate specificity, occur in several forms and play a 
major role in physiological homeostasis. For instance, the effects of bradykinin, 
angiotensin converting enzyme (ACE), and other vasoactive mokt^i^e muted by 
one of several peptidases that cleave the molecule at an internal position and eliminate 
its ability to bind its cognate recq)tor (e.g. for bradykinin, the B2-receptor). 

Among the enzymes that can cleave bradykinin is tfie membrane bound 
aminopqjtidase P, also referred to as aminoacylproline aminopeptidase, proline 
aminopeptidase; X-Pro aminopeptidase (eukaryote) and XPNPEP2. Aminopeptidase 
P is an aminoacylproline aminopeptidase specific for NH^-tenninal Xaa-proline 
bonds. The enzyme i) is a mono-zinc-containing molecule that lacks any of the typical 
metal binding motifs found in other zinc metalloproteases, ii) has an active-site 
configuration similar to that of other members of Uie MG peptidase family, and iii) is 
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present in a variety of tissues including but not limited to the lung, kidney, brain, and 
intestine. 



Aminopeptidases play an important role in a diverse set of human diseases. 
5 Low plasma concentrations of aminopeptidase P are a potential predisposing factor 
for development of angio-oedana in pati«its treated with ACE inhibitors, and 
inhibitors of aminopeptidase P may act as cardioprotectors against other forms of 
ilhiess including, but not limited to myocardial infarction. For these reasons, siRNAs 
directed against this family of proteins (including but not limited to XPNPEPl and 
1 0 related proteins) may be usefiil as research and therapeutic tools. 

Serine Proteases - - , 

One important class of proteases are the serine proteases. Serine proteases 
share a common catalytic triad of tiuree amino adds in their active site (serine 

1 5 (nucleophile), aspartate (electrophile), and histidine (base)) and can hydrolyze either 
esters or pqptide bonds utilizing mechanisms of covalent catalysis and preferential 
binding of the transition state. Based on the position of their introns serine proteases 
have been classified into a minimum of four groups including those in which 1) the 
g«ie has no introns interrupting the exon coding for the catalytic triad (e.g. the 

20 haptoglobin gene,); 2) eackgene contains an intron just downstream from the codon 
for the histidine residue at the active site, a second intron downstream from the exon 
containing the aspartic acid residue of the active site and a third intron just upstream 
from die exon containing the serine of the active site {e.g. trypsinogea^<«es^ 
chymotrypsinogen, kallikrein and proelastase); 3) the genes contain seven introns 

25 intenrupting the exons coding the catalytic region (e.g. complement factor B gene); 
and 4) the genes contain two introns resulting in a large exon that contains both the 
active site aspartatic add and serine residues {e.g. factor X, factor IX and protein C 
genes). 



Cytotoxic lymphocytes (e.g. CD8(+) cytotoxic T cells and natural killer cells) 
form the major defense of higher organisms against virus-infected and transformed 
cells. A key fimction of these cells is to detect and eliminate potentially harmful cells 
by indudng them to undergo apoptosis. This is achieved through two prindpal 
pathways, both of which require direct but transient contact between the killer cell and 
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its target. The first pathway involves ligation of TNF receptor-like molecules such as 
Fas/CD95 to their cognate ligands, and results in mobilization of conventional, 
programmed cell-death pathways centered on activation of pro-apoptotic caspases. 
The second mechanism consists of a pathway whereby the toxic contents of a 
5 specialized class of secretory vesicles are introduced into the target cell. Studies over 
the last two decades have identified the toxic components as Granzymes, a family of 
serine proteases that are expressed exclusively by cytotoxic T lymphocytes and 
natural killer (NK) cells. These agents are stored in specialized lytic granules and 
enter the target cell via endocytosis. Like caspases, cysteine proteases that play an 
1 0 important role in apoptosis, granzymes can cleave proteins after acidic residues, 
especially aspartic acid, and induce apoptosis in the recipient cell. 

Granzymes have been grouped into three subfamilies according to substrate 
specificity. Members of the granzyme family that have enzymatic activity similar to 

15 the serine5)rotease chymotrypsin are encoded by a gene cluster termed the 'chymase 
locus\ Similarly, granzymes with trypsin-like specificities are encoded by the 
'tryptase locus*, and a third subfamily cleaves after unbranched hydrophobic residues, 
especially methionine, and are encoded by the Met-ase locus'. All granzymes are 
synthesized as zymogens and, after clipping of the leader peptide, obtain maximal 

20 enzymatic activity subsequentiov the removal of an amino-terminal dipeptide. 

Granzymes have been found to be important in a nximber of important 
biological fimctions including defense against intracellular pathogens^^afk^v^rsus 
host reactions, the susceptibility to transplantable and spontaneous malignancies, 
25 lymphoid homeostasis, and the tendency toward auto-inmiune diseases. For these 
reasons, siRNAs directed against granszymes (e.g. GZMA, GZMB, GZMH, GZHK, 
GZMM) and related serine proteases may be usefijl research and therapeutic reagents. 

Kinases 

30 Protein Kinases (PKs) have been impHcated in a number of biological 

processes. Kinase molecules play a central role in modulating cellular physiology and 
developmental decisions, and have been implicated in a large list of human maladies 
including cancer, diabetes, and others. 
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During the course of the last three decades, over a hundred distinct protein 
kinases have been identified, all with presumed specific cellular fimctions.. A few of 
these enzymes have been isolated to sufficient purity to perform in vitro studies, but 
most ronain intractable due to the low abundance of these molecules in the cell. To 
5 countw this technical difficulty, a number of protein kinases have been isolated by 
molecular cloning strategies fliat utilize the conserved sequences of the catalytic 
domain to isolate closely related homologs. Alternatively, some kinases have been 
purified (and subsequently studied) based on their interactions with other molecules. 

10 p58 is a member of the p34cdc2-related supergene family and contains a large 

domain that is highly homologous to the cell division control kinase, cdc2. This new 
cell division control-related protein kinase was originally identified as a component of 
semipurified galactosyltransferase; thus, it has been denoted galactosyltransferase- 
associated protein kinase (GTA-kinase). GTA-kinase has been found to be expressed 

15 in bQth.adult and embryonic tissues and is known to phosphorylate a number of 

substrates, including histone HI , and casein. Interestingly enough, over expression of 
this molecule in CHO cells has shown that elevated levels of p58 result in a prolonged 
late telophase and an early Gl phase, thus hinting of an important role for GTA- 
kinase in cell cycle regulation. 

20 

Cvclin Dependent Kinases 

The cyclin-dependent kinases (Cdks) are a family of highly conserved 
f^^u.'.—^,^ serine/threonine kinases that mediate many of the cell cycle transitions thatoccuT ; 

during duplication. Each of these Cdk catalytic subunits associates with a specific 
25 subset of regulatory subunits, tenned cyclins, to produce a distinct Cdk cyclin kinase 
complex that, in general, fimctions to execute a unique cell cycle event. 

Activation of the Cdk-cyclin kinases during cellular transitions is controlled by 
a variety of regulatory mechanisms. For theCdc2xyclin B complex, inhibition of 
30 kinase activity during S phase and G2 is accomplished by phosphorylation of two 
Cdc2 residues, Thr^^ and Tyr^^ which are positioned within the ATP-binding cleft. 
Phosphorylation of Thr"* and/or Tyr^^ suppresses the catalytic activity of the molecule 
by disrupting the orientation of the ATP present within this cleft. In contrast, the 
abrupt dq)hosphorylation of tiiese residues by the Cdc25 phosphatase results in the 
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rapid activation of Cdc2 cycliii B kinase activity and subsequent downstream mitotic 
events. While the exact details of this pathway have yet to be elucidated, it has been 
proposed that Thr^'^/Tyr^^ phosphorylation functions to permit a cell to attain a critical 
concentration of inactive Cdk cyclin complexes, which, upon activation, induces a 
5 rapid and complete cell cycle transition. Furthermore, there is evidence in 
mammahan cells that Thr^'^/Tyr^^ phosphorylation also functions to delay Cdk 
activation after DNA damage. 

The Schizosaccharomyces pombe weel gene product was the first kinase 
1 0 identified that is capable of phosphorylating Tyr*^ in Cdc2. Homologs of the Weel 
kinase have been subsequently identified and biochemically characterized firom a wide 
range of species including hxraian, mouse, fi"Og, Sacchdromyces cerevisiae, and 
Drosophila. Li vertebrate systems, where Thr*"^ in Cdc2 is also phosphorylated, the 
Weel kinase was capable of phosphorylating Cdc2 on Tyr*^, but not Thr^^, indicating 
1 5 that another kinase was responsible for Thr^"^ phosphorylation. This gene, Mytl 

kinase, was recently isolated fit)m the membrane fractions of Xenopus egg extracts 
and has been shown to be capable of phosphorylating Thr*"^ and, to a lessor extent, 
Tyr^^ in Cdc2. A human Mytl homolog displaying similar properties has been 
isolated, as well as a non-membrane-associated molecule with Thr^"* kinase activity. 
20 - 

In the past decade it has been shown that cancer can originate firom 
overexpression of positive regulators, such as cyclins, or fi^om underexpression of 
negative regulators (e.g. pl6 (INK4a), pl5 (INK4b), p21 (Cipl)). Inhibitors sucfras:..- .... 
Mytl are the focus of much cancer research because they are capable of controlling 

25 cell cycle proliferation, now considered the Holy Grail for cancer treatment. For these 
reasons, siRNA directed against kinases and kinase inhibitors including but not 
limited to ABLl, ABL2, ACKl, ALK, AXL, BLK, BMX, BTK, C20orf64, CSFIR, 
SCK, DDRl, DDR2, DKFZp761P1010, EGFR, EPHAl, EPHA2, EPHA3, EPHA4, 
EPHA7, EPHA8, EPHBl, EPHB2, EPHB3, EPHB4. EPHB6, ERBB2, ERBB3, 

30 ERBB4, FER, FES, FGFRl , FGFR2, FGFR3, FGFR4, FGR, FLTl , FLT3, FLT4, 
FRK, FYN, HCK, IGFIR, INSR, ITK, JAKl, JAK2, JAK3, KDR, KIAA1079, KIT, 
LCK, LTK, LYN, MATK, MERTK, MET, MSTIR, MUSK, NTRKl, NTRK2, 
NTRK3, PDGFRA, PDGFRB, PTK2, PTK2B, PTK6, PTK7, PTK9, PTK9L, RET, 
RORl, ROR2, ROSl, RYK, SRC, SYK, TEC, TEK, TIE, TNKl, TXK, TYK2, 
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TYROS, YESl, and related proteins, may be useful for research and therapeutic 
purposes. 

G Protein Coupled Receptors 

5 

One important class of genes to which siRNAs can be directed are G-protein 
coupled receptors (GPCRs). GPCRs constitute a superfamily of seven transmembrane 
spanning proteins that respond to a diverse array of sensory and chemical stimuli, 
such as light, odor, taste, pheromones, hormones and neurotransmitters. GPCRs play a 
1 0 central role in cell proUferation, differentiation, and have been implicated in the 
etiology of disease. 



The mechanism by which G protein-coupled receptors translate extracellular 
signals into cellular changes was initially envisioned as a simple linear model: 

15 activation of the receptor by agonist binding leads to dissociation of the heterotrimeric 
GTP-binding G protein (Gs, Gi, or Gq) into its alpha and beta/gamma suburtits, both 
of which can activate or inhibit various downstream effector molecules. More 
specifically, activation of the GPCR induces a conformational change in the Ga 
subunit, causing GDP to be released and GTP to be bound in its place. The Ga and 

20 Gpy subunits then dissociate from the receptor and interact with a variety of effector 
molecules. For instance in the case of the Gs family, the primary function is to 
stimulate the intracellular messenger adenylate cyclase (AC), which catalyzes the 
conversion of cytoplasmic ATP into the secondary messenger cyclic AMP (cAMP). 
In contrast, the Gi family inhibits this pathway and the Gq family activates 

25 phospholipases C (PLC), which cleaves phosphatidylinositol 4,5, bisphosphate (PIP2) 
to generate inositol-l,4,5-phosphate (IPS) and diacylglycerol (DAG). 



More recently, studies have shown that the functions of GPCRs are not limited 
to their actions on G-proteins and that considerable cross-talk exists between this 
30 diverse group of receptor molecules and a second class of membrane bound proteins, 
the receptor tyrosine kinases (RTKs). A number of GPCRs such as endothelin-1, 
thrombin, bombesin, and dopamine receptors can activate MAPKs, a downstream 
effector of the RTK/Ras pathway. Interestingly, the interaction between these two 
famiUes is not unidirectional and RTKs can also modulate the activity of sigjialing 
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pathways traditionally thought to be controlled exclusively by ligands that couple to 
GPCRs. For instance, EGF, which normally activates the MAPK cascade via the 
EGF receptor can stimulate adenylate cyclase activity by activating Gas. 



5 There are dozens of members of the G Protein-Coupled Receptor family that 

have emerged as prominent drag targets in the last decade. One non-limiting list of 
potential GPCR-siRNA targets is as follows: 

CMKLRl 

10 CMLl/ CMKLRl (Accession No. Q99788) is a member of the chemokine 

receptor family of GPCRs that may play a role in a number of diseases including 
those involved in inflammation and immunological responses {e.g. asthma, arthritis). 
For this reason, siRNA directed against this protein may prove to be important 
therapeutic reagents. 

15 

Studies of juvenile-onset neuronal ceroid lipofuscinosis (JNCL, Batten 
disease), the most common form of childhood encephalopathy that is characterized by 
progressive neural degeneration, show that it is brought on by mutations in a novel 
lysosomal membrane protein (CLN3). In addition to being implicated in JNCL, 

20 CLN3 (GPCR-like protein, Accession No. A57219) expression studies have shown 
that the CLN3 mRNA and protein are highly over-expressed in a number of cancers 
(e.g. glioblastomas, neuroblastomas, as well as cancers of the prostate, ovaries, breast, 
— " ^-'^ - and colon) suggesting a possible contribution of this gene to tumor growth. For this 
reason, siRNA directed against this protein may prove to be important therapeutic 

25 reagents. 



CLACR 

The calcitonin receptor (CTR/ CALCR, Accession No. NM__001742) belongs 
to "family B" of GPCRs which typically recognized regulatory peptides such as 
30 parathyroid hormone, secretin, glucagons and vasoactive intestinal polypeptide. 

Although the CT receptor typically binds to calcitonin (CT), a 32 amino acid peptide 
hormone produced primarily by the thyroid, association of the receptor with RAMP 
(Receptor Activity Modulating Protein) enables it to readily bind other members of 
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the calcitonin peptide family including amylin (AMY) and other CT gene-related 
peptides (e.^. aCGRP and pCGRP). While the primary function of the calcitonin 
receptor pertains to regulating osteoclast mediated bone resorption and enhanced Ca^^ 
excretion by the kidney, recent studies have shown that CT and CTRs may play an 
5 important role in a variety of processes as wide ranging as embryonic/foetal 

development and sperm fimction/physiology. In addition, studies have shown that 
patients witii particular CTR genotypes may be at higher risk to lose bone mass and 
that this GPCR may contribute to the formation of calcium oxalate urinary stones. For 
this reason, siRNA directed against CTR may be useful as therapeutic reagents. 

10 

OXTR 

The human oxytocin receptor (OTR, OXTR) is a 3 89 amino acid polypeptide 
that exhibits the seven transmembrane domain structure and belongs to the Class-I 
(rhodopsin-type) family of G-protein coupled receptors. OTR is expressed in a wide 

1 5 variety of tissues tiiroughout development and mediates physiological changes ' 
through G(q) proteins and phospholipase C-beta, Studies on the functions of oxytocin 
and the oxytocin receptor have revealed a broad list of duties. OT and OTR play a 
role in a host of sexual, maternal and social behaviors that include egg-laying, birth, 
milk-letdown, feeding, grooming, memory and learning. In addition, it has been 

20 hypothesized that abnormalities in the fiinctionaliQ^ bf oxytocin-OTR receptor-ligand 
system can lead to a host of irregularities including compulsive behavior, eating 
disorders (such as anorexia), depression, and various forms of neurodegenerative 
diseases. For these reasons, siRlSTA directed against this gene (NM_000916) may play ^m^^^ 
an important role in combating OTR-associated illnesses. 

25 

EDG GPCRs 

Lysophosphatidic acid and other lipid-based hormones/growth factors induce 
their effects by activating signaling pathways through the G-protein coupled receptors 
(GPCRs) and have been observed to play important roles in a niraiber of human 
30 diseases including cancer, asthma, and vascular pathologies. For instance, during 
studies of immunoglobulin A nephropathy (IgAN), researchers have observed an 
enhanced expression of EDG5 (NP_004221) suggesting a contribution of tfiis gene 
product in tiie development of IgAN. For that reasons, siRNA directed against Edg5 
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(NM_004230), Edg4 (NM_004720), Edg7 (Nm_012152) and related genes may play 
an important role in combating human disease. 

Gaies Involved in Chol esterol Sig n aling and Biosynthesis 

Studies on model genetic organisms such as Drosophila and C. elegans have 
led to the identification of a plethora of genes that are essential for early development. 
Mutational analysis and ectopic expression studies have allowed many of these genes 
to be grouped into discreet signal transduction pathways and have shown that these 
elements play critical roles in pattern formation and cell differentiation. Disruption of 
one or more of these genes during early stages of development frequaitly leads to 
birth defects whereas as alteration of gene function at later stages in life can result in 
tumorigenesis. 

One critical set of interactions known to exist in both invertebrates and 
vertebrates is the Sonic Hedgehog-Patched-GU pathway. Originally documented as a 
Drosophila segmentation mutant, several labs have recently identified himian and 
mouse orthologs of many of the pathways members and have successfiilly related 
disruptions in these genes to known diseases. Pathway activation is initiated with the 
secretion of Sonic hedgehog. There are three closely related members of the Shh 
family (Sonic hedgehog. Desert, and Lidian) with Shh bang flie most widely 
expressed form of the group. The Shh gene product is sea-eted as a small pro-signal 
molecule. To successfiilly initiate its developmental role, Shh is first cleaved, 
- wheceupon the N-terminal tiimcated fragment is covalently modified with cholesterol. 
The addition of the sterol moiety promotes the interaction between Shh and its 
cognate membrane bound receptor, Patched (Ptch). There are at least two isofonns of 
the Patched gene, Ptchl and Ptch2. Both isofomis contain a sterol-sensing domain 
(SSD); a roughly 1 80 amino acid cluster that is found in at least seven different 
classes of molecules including those involved in cholesterol biosynthesis, vesicular 
traffic, signal transduction, cholesterol transport, and sterol homeostasis. In the 
absence of Shh, the Patched protein is a negative regulator of the pathway. In contrast, 
binding of Shh-cholesterol to the Patdied recq)tor releases the negative inhibition 
which that molecule enforces on a G-protein coupled recqptor known as Smoothened. 
Subsequent activation of Smoothened (directly or indirectly) leads to the triggering of 
a hio of transcription factors that belong to the Gli family. All three factors are 
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relatively large, contain a charactisristic C2-H2 zinc-finger pentamer, and recognize 
one of two consensus sequences (SEQ. ID NO. 0463 GACCACCCA or SEQ. ID NO. 
0464 GAACCACCCA). In the absence of Shh, Gli proteins are cleaved by the 
proteosome and the C-terminally truncated fragment translocates to the nucleus and 
5 acts as a dominant transcription repressor. In the presence of Shh-cholesterol, Gli 
repressor formation is inhibited and fiiU-iength Gli functions as a transcriptional 
activator. 

Sbh and other members of the Shh-PTCH-Gli pathway are expressed in a 
10 broad range of tissues (e.g. the notochord, the flooiplate of the neural tube, the brain, 
and the gut) at early stages in development. Not surprisingly, mutations that lead to 

altered protein expression or function have been shown to induce developinental 
abnormalities. Defects in the human Shh gene have been shown to cause 
holoprosencephaly, a midline defect that manifests itself as cleft lip or palate, CNS 

1 5 septation, and a wide range of other phenotypes. Interestingly, defects in cholesterol 
biosynthesis generate similar Shh-like disorders (e.g. Smith-Lemli-Opitz syndrome) 
suggesting that cholesterol modification of the Shh gene product is crucial for 
pathway function. Both the Patched and Smoothened genes have also been shown to 
be clinically relevant with Smoothened now being recognized as an oncogene that, 

20 like PTCH-1 and PTCH-2, is believed to be flie causative,=agent of sevwal forms of 
adult tumors. For these reasons, siRNA directed against Smoothened (SMO, 
NM_005631), Patched (PTCH, nm_000264), and additional genes that participate in 

choleste^-signaling, biosynthesis, and degradation, have potentially useful researdi . - 
and ther^utic applications. 

25 

Targeted Pathways . 

hi addition to targeting siRNA against one or more members of a family of 
proteins, siRNA can be directed against members of a pathway. Thus, for instance, 
siRNA can be directed against mranbers of a signal transduction pathway (e.g. the 
30 insulin pathway, including AKTl-3, CBL, CBLB, EIF4EBP1, FOXOIA, F0X03A, 
FRAPl, GSK3A, GSK3B, IGFl, IGFIR, INPP5D, INSR, IRSl, MLLT7, PDPKl, 
PIK3CA, PIK3CB, PIK3R1, PIK3R2, PPP2R2B, PTEN, RPS6, RPS6KAI, 
RPX6KA3, SGK, TSCl, TSC2, AND XPOl). an apoptotic pathway (CASP3,6,7,8,9, 
DSHl/2, PI 1 0, P85, PDKl/2, CATENIN, HSP90, CDC37, P23, BAD, BCLXL, 
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BCL2, SMAC, and others), pathways, involved in DNA damage, cell cycle, and other 
physiological (p53,MDM2, CHKl/2, BRCAl/2, ATM, ATR, P15INK4, P27, P21, 
SKP2, CDC25C/A, 14-3-3, PLK, RB, CDK4, GLUT4, Inos, Mtor, FKBP, PPAR, 
RXR, ER). Similarly, genes involved in immune system function including TNFRl, 

5 IL-IR, IRAKl/2, TRAF2, TRAF6, TRADD, FADD, IKKe, IKKy, IKKP, IKKa, 
IkBa, IkBp, p50, p65, Rac, RhoA, Cdc42, ROCK, Pakl/2/3/4/5/6, cIAP, HDACl/2, 
CBP, p-TrCP, Rip2/4, and others are also important targets for the siRNAs described 
in this document and may be useful in treating immune system disorders. Genes 
involved in apoptosis, such as Dshl/2,PTEN, PI 10 (pan), P85, PDKl/2, Aktl, Akt2, 

1 0 Akt (pan), p70^^^ GSK3 P, PP2A (cat), p-catenin, HSP90, Cdc37/p50, P23, Bad, 
BclxL, Bcl2, Smac/Diablo, and Askl are potentially useful in the treatment of 
diseases that involve defects in programmed cell death (e.g. cancer), while siRNA 
agents directed against p53, MDM2, Chkl/2, BRCAl/2, ATM, ATR, pis"^'^^ P27, 
P21, Skp2, Cdc25C/A, 14-3-3a/e, PLK, Rb, Cdk4, Glut4, iNOS, mTOR, FKBP, 

1 5 PPARy, RXRa, ERq and related genes may play a critical role in combating diseases 
associated with disraptions in DNA repair, and cell cycle abnormalities. 

Tables VI -Table X below provide examples of useful pools for inhibiting 
different genes in the human insulin pathway and tyrosine kinase pathways, proteins 

20 involved in the cell cycle, the production of nuclear receptors^and other genes. These 
particular pools are particularly useful in humans, but would be useful in any species 
that generates an appropriately homologous mRNA. Further, within each of the listed 
pools any cTnesequence maybe used independently but preferably at least two of the 
listed sequences, more preferably at least three, and most preferably all of the listed 

25 sequences for a given gene is present. 



Table VI 



Gene 
Name 


ACC# 


Gl 


L.L 


Duplex # 


Sequence 


SEQ. 
ID NO. 


AKT1 


NM 005163 


4885060 


207 


D-003000-05 


GACAAGGACGGGCACATTA 


465 


AKT1 


NM 005163 


4885060 


207 


D-003000-06 


GGACAAGGACGGGCACATT 


466 


AKT1 


NM 005163 


4885060 


207 


D-003000-07 


GCTACTTCCTCCTCAAGAA 


467 


AKT1 


NM 005163 


4885060 


207 


D-003000-08 


GACCGCCTCTGCTTTGTCA 


468 


AKT2 














AKT2 


NM 001626 


6715585 


208 


D-003001-05 


GTACTTCGATGATGAATTT 


469 


AKT2 


NM 001626 


6715585 


208 


D-003001-06 


GCAAAGAGGGCATCAGTGA 


470 


AKT2 


NM 001626 


6715585 


208 


D-003001-07 


GGGCTAAAGTGACCATGAA 


471 
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AKT2 


NM 001626 


6715585 


208 


D-003001-08 


GCAGAATGCCAGCTGATGA 


472 


AKT3 














AKT3 


NM 005465 


32307164 


10000 


D-003002-05 


GGAGTAAACTGGCAAGATG 


473 


AKT3 


NM 005465 


32307164 


10000 


D-003002-06 


GACATTAAATTTCCTCGAA 


474 


AKT3 


NM 005465 


32307164 


10000 


D-003002-07 


GACCAAAGCCAAACACATT 


475 


AKT3 


NM 005465 


32307164 


10000 


D-003002-08 


GAGGAGAGAATGAATTGTA 


476 


CBL 














CBL 


NM 005188 


4885116 


867 


D-003003-05 


GGAGACACATTTCGGATTA 


477 


CBL 


NM 005188 


4885116 


867 


D-003003-06 


GATCTGACCTGCAATGATT 


478 


CBL 


NM 005188 


4885116 


867 


D-003003-07 


GACAATCCCTCACAATAAA 


479 


CBL 


NM 005188 


4885116 


867 


D-003003-08 


CCAGAAAGCTTTGGTCATT 


480 


CBLB 














CBLB 


NM 170662 


29366807 


868 


D-003004-05 


GACCATACCTCATAACAAG 


481 


CBLB 


NM 170662 


29366807 


868 


D-0030O4-O6 


TGAAAGACCTCCACCAATC 


482 


CBLB 


NM 170662 


29366807 


868 


D-003004-07 


GATGAAGGCTCCAGGTGTT 


483 


CBLB 


NM 170662 


29366807 


868 


D-003004-08 


TATCAGCATTTACGACTTA 


484 


EIF4EBP1 














EIF4EBP1 


NM_004095 


20070179 


1978 


D-003005-05 


GCAATAGCCCAGAAGATAA 


485 


EIF4EBP1 


NM 004095 


20070179 


1978 


D-003005-06 


CGCAATAGCCCAGAAGATA 


486 


EIF4EBP1 


NM_004095 


20070179 


1978 


D-003005-07 


GAGATGGACATTTAAAGCA 


487 


EIF4EBP1 


NM 004095 


20070179 


1978 


D-003005-08 


CAATAGCCCAGAAGATAAG 


488 


FOX01A 














FOX01A 


NM 002015 


9257221 


2308 


D-003006-05 


CCAGGCATCTCATAACAAA 


489 


FOX01A 


NM 002015 


9257221 


2308 


D-003006-06 


CCAGATGCCTATACAAACA 


490 


FOX01A 


NM 002015 


9257221 


2308 


D-003006-07 


GGAGGTATGAGTCAGTATA 


491 


FOX01A 


NM 002015 


9257221 


2308 


D-003006-08 


GAGGTATGAGTCAGTATAA 


492 


FOX03A 














FOX03A 


NM 001455 


4503738 


2309 


D-003007-01 


CAATAGCAACAAGTATACC 


493 


FOX03A 


NM 001455 


4503738 


2309 


D-003007-02 


TGAAGTCCAGGACGATGAT 


494 


F0X03A 


NM 001455 


4503738 


2309 


D-003007-03 


TGTCACACTATGGTAACCA 


495 


FOX03A 


NM 001455 


4503738 


2309 


D-003007-04 


TGTTCAATGGGAGCTTGGA 


496 


FRAP1 














FRAP1 


NM 004958 


19924298 


2475 


D-003008-05 


GAGAAGAAATGGAAGAAAT 


497 


FRAP1 


NM 004958 


19924298 


2475 


D-003008-06 


CCAAAGTGCTGCAGTACTA 


498 


FRAP1 


NM 004958 


19924298 


2475 


D-003008-07 


GAGCATGCCGTCAATAATA 


499 


FRAP1 


NM_004958 


19924298 


2475 


D-003008-08 


GGTCTGAACTGAATGAAGA 


500 


GSK3A 














GSK3A 


NM 019884 


11995473 


2931 


D-003009-05 


GGACAAAGGTGTTCAAATC 


501 


GSK3A 


NM 019884 


11995473 


2931 


D-003009-06 


GAACCCAGCTGCCTAACAA 


502 


GSK3A 


NM 019884 


11995473 


2931 


D-003009-07 


GCGCACAGCTTCTTTGATG 


503 


GSK3A 


NM 019884 


11995473 


2931 


D-003009-08 


GCTCTAGCCTGCTGGAGTA 


504 


GSK3B 














GSK3B 


NM 002093 


21361339 


2932 


D-003010-05 


GAAGAAAGATGAGGTCTAT 


505 


GSK3B 


NM 002093 


21361339 


2932 


D-003010-06 


GGACCCAAATGTCAAACTA 


506 


GSK3B 


NM 002093 


21361339 


2932 


D-003010-07 


GAAATGAACCCAAACTACA 


507 


GSK3B 


NM 002093 


21361339 


2932 


D-003010-08 


GATGAGGTCTATCTTAATC 


508 


IGF1 














IGF1 


NM 000618 






D-003011-05 


GGAAGTACATTTGAAGAAC 


509 


IGF1 


NM 000618 






D-003011-06 


AGAAGGAAGTACATTTGAA 


510 


IGF1 . 


NM 000618 






D-003011-07 


CCTCAAGCCTGCCAAGTCA 


511 


IGF1 


NM 000618 






D-003011-08 


GGTGGATGCTCTTCAGTTC 


512 


IGF1R 














IGF1R 


NM 000875 


11068002 


3480 


D-003012-05 


CAACGAAGCTTCTGTGATG 


513 


IGF1R 


NM 000875 


11068002 


3480 


D-003012-06 


GGCCAGAAATGGAGAATAA 


514 


IGF1R 


NM 000875 


11068002 


3480 


D-003012-07 


GAAGCACCCTTTAAGAATG 


515 


IGF1R 


NM 000875 


11068002 


3480 


D-003012-08 


GCAGACACCTACAACATCA 


516 
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INPP5D 














INPP5D 


NM 005541 


5031798 


3635 


D-003013-05 


GGAATTGCGTTTACACTTA 


517 


INPP5D 


NM 005541 


5031798 


3635 


D-003013-06 


GGAAACTGATCATTAAGAA 


518 


INPP5D 


NM 005541 


5031798 


3635 


D-003013-07 


CGACAGGGATGAAGTACAA 


519 


INPP5D 


NM 005541 


5031798 


3635 


D-003013-08 


AAACGCAGGTGCCCATCTA 


520 


INSR 














INSR 


NM 000208 


4557883 


3643 


D-003014-05 


GGAAGACGTTTGAGGATTA 


521 


INSR 


NM 000208 


4557883 


3643 


D-003014-06 


GAACAAGGGTCCCGAGAGT 


522 


INSR 


NM 000208 


4557883 


3643 


D-003014-07 


GGAGAGACCTTGGAAATTG 


523 


INSR 


NM 000208 


4557883 


3643 


D-003014-08 


GGACGGAACCCACCTATTT 


524 


IRS1 














IRS1 


NM_005544 


5031804 


3667 


D-003015-05 


AAAGAGGTCTGGCAAGTGA 


525 


IRS1 


NM 005544 


5031804 


3667 


D-003015-06 


GAACCTGATTGGTATCTAC 


526 


IRS1 


NM 005544 


5031804 


3667 


D-003015-07 


GCACGGCGATGTAGTGCTT 


527 


IRS1 


NM 005544 


5031804 


3667 


D-003015-08 


GTCAGTCTGTCGTCCAGTA 


528 


MLLT7 














MLLT7 


NM_005938 


5174578 


4303 


D-003016-05 


GGACTGGACTTCAACTTTG 


529 


MLLT7 


NM 005938 


5174578 


4303 


D-003016-06 


CCACGAAGCAGTTCAAATG 


530 


MLLT7 


NM 005938 


5174578 


4303 


D-003016-07 


GAGAAGCGACTGACACTTG 


531 


MLLT7 


NM 005938 


5174578 


4303 


D-003016-08 


GACCAGAGATCGOTAACCA 


532 


PDPK1 














PDPK1 


NM 002613 


4505694 


5170 


D-003017-05 


CAAGAGACCTCGTGGAGAA 


533 


PDPK1 


NM 002613 


4505694 


5170 


D-003017-06 


GACCAGAGGCCAAGAATTT 


534 


PDPK1 


NM 002613 


4505694 


5170 


D-003017-07 


GGAAACGAGTATCTTATAT 


535 


PDPK1 


NM 002613 


4505694 


5170 


D-003017-08 


GAGAAGGGACATATCATAA 


536 


PIK3CA 














PIK3CA 


NM 006218 


5453891 


5290 


D-003018-05 


GCTATCATCTGAACAATTA 


537 


PIK3CA 


NM 006218 


5453891 


5290 


D-003018-06 


GGATAGAGGCCAAATAATA 


538 


PIK3CA 


NM 006218 


5453891 


5290 


D-003018-07 


GGACAACTGTTTCATATAG 


539 


PIK3CA 


NM 006218 


5453891 


5290 


D-003018-08 


GCCAGTACCTCATGGATTA 


540 


PIK3CB 














PIK3CB 


NM 006219 


5453893 


5291 


D-003019-05 


CGAGAAGAGTGCGGAGAGA 


541 


PIK3CB 


NM 006219 


5453893 


5291 


D-003019-06 


TCAAGTGTGTGCTAATATG 


542 


PIK3CB 


NM 006219 


5453893 


5291 


D-003019-07 


GGATTCAGTTGGAGTGATT 


543 


PIK3CB 


NM 006219 


5453893 


5291 


D-003019-08 


TTTCAAGTGTCTCCTAATA 


544 


PIK3R1 














PIK3R1 


NM 181504 


32455251 


5295 


D-003020-05 


GGAAATATGGGTTGTGTGA 


545 


PIK3R1 


NM ,181.504..,. 


32455251 


5295 


D-003020-06 


GAAAGAGGAGAGACCAATA 


546 


PIK3R1 


NM 181504 


32455251 


5295 


D-003020-07 


GTAAAGCATTGTGTCATAA 


547 


PIK3R1 


NM 181504 


32455251 


5295 


D-003020-08 


GGATCAAGTTGTCAAAGAA 


548 


PIK3R2 














PIK3R2 


NM 005027 


4826907 


5296 


D-003021-05 


GGAAAGGCGGGAACAATAA 


549 


PIK3R2 


NM 005027 


4826907 


5296 


D-003021-06 


GATGAAGCGTACTGCAATT 


550 


PIK3R2 


NM 005027 


4826907 


5296 


D-003021-07 


GGAGAGGGAATCTGACTAC 


551 


PIK3R2 


NM 005027 


4826907 


5296 


D-003021-O8 


GCAAGATCCGAGACCAGTA 


552 


PPP2R2B 














PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-05 


GAATGCAGGTTACTTTCTT 


553 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-06 


GACGGAAGGTGAGATTATC 


554 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-07 


TCGATTACGTGAAGAGTTT 


555 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-08 


CCTGAAGAGTTTAGAAATA 


556 


FT EN 














PTEN 


NM 000314 


4506248 


5728 


D-003023-05 


GTGAAGATCTTGACCAATG 


557 


PTEN 


NM 000314 


4506248 


5728 


D-003023-06 


GATGAGCATACACAAATTA 


558 


PTEN 


NM 000314 


4506248 


5728 


D-003023-07 


GGCGCTATGTGTATTATTA 


559 


PTEN 


NM 000314 


4506248 


5728 


D-003023-08 


GTATAGAGCGTGCAGATAA 


560 


RPS6 
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RPS6 


NM 001010 


17158043 


6194 


D-003024-05 


GCCAGAAACTCATTGAAGT 


561 


RPS6 


NM 001010 


17158043 


6194 


D-003024-06 


GGATATTCCTGGACTGACT 


562 


RPS6 


NM 001010 


17158043 


6194 


D-003024-07 


CCAAGGAGAACTGGAGAAA 


563 


RPS6 


NM 001010 


17158043 


6194 


D-003024-08 


GCGTATGGCGACAGAAGTT 


564 


RPS6KA1 














RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-05 


GATGACAGCTTCTACTTTG 


565 


RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-06 


GAGAATGGGCTCCTCATGA 


566 


RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-07 


CAAGCGGGATCCTTCAGAA 


567 


RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-08 


CCACCGGCCTGATGGAAGA 


568 


RPS6KA3 














RPS6KA3 


NM_004586 


4759049 


6197 


D-003026-05 


GAAGGGAAGTTGTATCTTA 


569 


RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-06 


GAAAGTATGTGTATGTAGT 


570 


RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-07 


GGACAGCATCCAAACATTA 


571 


RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-08 


GGAGGTGAATTGCTGGATA 


572 


SGK 














SGK 


NM 005627 


5032090 


6446 


D-003027-01 


TTAATGGTGGAGAGTTGTT 


573 


SGK 


NM 005627 


5032090 


6446 


D-003027-04 


ATTAACTGGGATGATCTCA 


574 


SGK 


NM 005627 


25168262 


6446 


D-003027-05 


GAAGAAAGGAATCCTGAAA 


575 


SGK 


NM 005627 


25168262 


6446 


D-003027-06 


AAACACAGCTGAAATGTAC 


576 


TSC1 














TSC1 


NM 000368 


24475626 


7248 


D-003028-05 


GAAGATGGCTATTCTGTGT 


577 


TSC1 


NM 000368 


24475626 


7248 


D-003028-06 


TATGAAGGCTCGAGAGTTA 


578 


TSC1 


NM 000368 


24475626 


7248 


D-003028-07 


CGACACGGCTGATAACTGA 


579 


TSC1 


NM 000368 


24475626 


7248 


D-003028-08 


CGGCTGATGTTGTTAAATA 


580 


TSC2 














TSC2 


NM 000548 


10938006 


7249 


D-003029-05 


GCATTAATCTCTTACCATA 


581 


TSC2 


NM 000548 


10938006 


7249 


D-003029-06 


CGAATGTCCTCTTGTCTTT 


582 


TSC2 


NM 000548 


10938006 


7249 


D-003029-07 


GGAGACACATCACCTACTT 


583 


TSC2 


NM 000548 


10938006 


7249 


D-003029-08 


TCAGCAGGCTCATCAAGAA 


584 


XP01 














XP01 


NM 003400 


8051634 


7514 


D-003030-05 


GAAAGTCTCTGTCAAAATA 


585 


XP01 


NM 003400 


8051634 


7514 


D-003030-06 


GCAATAGGCTCCATTAGTG 


586 


XP01 


NM 003400 


8051634 


7514 


D-003030-07 


GGAACATGATCMCTTATA 


587 


XP01 


NM 003400 


8051634 


7514 


D-003030-08 


GGATACAGATTGCATAAAT 


588 



Table VM 



Gene 
Name 


Acc# 


Gl 


LL 


Duplex # 


Sequence 


SEQ. 
ID NO. 


ABL1 














ABL1 


NM 007313 


6382057 


25 


D-0031 00-05 


GGAAATCAGTGACATAGTG 


589 


ABL1 


NM_007313 


6382057 


25 


D-0031 00-06 


GGTCCACACTGCAATGTTT 


590 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-07 


GAAGGAAATCAGTGACATA 


591 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-08 


TCACTGAGTTCATGACCTA 


592 


ABL2 














ABL2 


NM 007314 


6382061 


27 


D-0031 01 -05 


GAAATGGAGCGAACAGATA 


593 


ABL2 


NM 007314 


6382061 


27 


D-003101-06 


GAGCCAAATTTCCTATTAA 


594 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -07 


GTAATAAGCCTACAGTCTA 


595 


ABL2 


NM_007314 


6382061 


27 


D-0031 01-08 


GGAGTGAAGTTCGCTCTAA 


596 


ACK1 














ACK1 


NM 005781 


8922074 


10188 


D-0031 02-05 


AAACGCAAGTCGTGGATGA 


597 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-06 


GCAAGTCGTGGATGAGTAA 


598 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-07 


GAGCACTACCTCAGAATGA 


599 



106 





NM 005781 


8922074 


10188 


D-0031 02-08 


TCAGCAGCACCCACTATTA 


600 
















AI 1/' 

ALIV 


NM 004304 


29029631 


238 


D-0031 03-05 


GACAAGATCCTGCAGAATA 


601 


AI 1/" 

ALK 


NM 004304 


29029631 


238 


D-0031 03-06 


GGAAGAGTCTGGCAGTTGA 


602 


AI i/^ 
ALK 


NM_ 004304 


29029631 


238 


D-0031 03-07 


GCACGTGGCTGGGGACATT 


603 


AI 1/ 

ALK 


NM 004304 


29029631 


238 


D-0031 03-08 


GAACTGCAGTGAAGGAAGA 


604 


AVI 

AXL 














AVI 

AXL 


NM 021913 


21536465 


558 


D-003104-05 


GGTCAGAGCTGGAGGATTT 


605 


AVI 

AaL 


NM 021913 


21536465 


558 


D-0031 04-06 


GAAAGAAGGAGACCCGTTA 


606 


AVI 

AXL 


NM 021913 


21536465 


558 


D-0031 04-07 


CCAAGAAGATCTACAATGG 


607 


AVI 

AXL 


NM 021913 


21536465 


558 


D-0031 04-08 


GGAACTGCATGCTGAATGA 


608 


Dl IX 

BLK 














BLK 


NM 001715 


4502412 


640 


D-0031 05-05 


GAGGATGCGTGCTGGATTT 


609 


BLK 


NM 001715 


4502412 


640 


D-0031 05-06 


ACATGAAGGTGGGCATTAA 


610 


BLK 


NM 001715, 


4502412 


640 


D-0031 05-07 


GGTGAGCGCCCAAGACAAG 


611 


BLK 


NM 001715 


4502412 


640 


D-0031 05-08 


GAAACTCGGGTCTGGACAA 


612 


BMX 














BMX 


NM 001721 


21359831 


660 


D-0031 06-05 


AAACAAACCTTTCCTACTA 


613 


BMX 


NM 001721 


21359831 


660 


D-0031 06-06 .. 


GAAGGAGCATTTATGGTTA 


614 


BMX 


NM 001721 


21359831 


660 


D-0031 06-07 


GAGAAGAGATTACCTTGTT 


615 


BMX 


NM 001721 


21359831 


660 


D-003106-08 


GTAAGGCTGTGAATGATAA 


616 


BTK 














BTK 


NM 000061 


4557376 


695 


D-0031 07-05 


GAACAGGAATGGAAGCTTA 


617 


BTK 


NM 000061 


4557376 


695 


D-0031 07-06 


GCTATGGGCTGCCAAATTT 


618 


BTK 


NM 000061 


4557376 


695 


D-0031 07-07 


GAAAGCAACTTACCATGGT 


619 


BTK 


NM 000061 


4557376 


695 


D-0031 07-08 


GGTAAACGATCAAGGAGTT 


620 


C20orf64 














C20orf64 


NM 033550 


19923655 


11285 


D-0031 08-05 


CAACTTAGCCAAGACAATT 


621 


C20orf64 


NM 033550 


19923655 


11285 


D-0031 08-06 


GAAA7TGAAGGCTGAGTGA 


622 


C20orf64 


NM 033550 


19923655 


11285 


D-0031 08-07 


TGGAAGAGCTGAACATTGT 


623 


C20orf64 


NM 033550 


19923655 


11285 


D-0031 08-08 


GCTTGCAACTGGTTATATA 


624 


CSF1R 














CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-05 


GGAGAGCTCTGAG6TTTGA 


625 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-06 


CAAGAACGCTACGrrCCAA 


626 


CSF1R 


NM 005211 


27262658 


1436 


D-003109-07 


CCACGCAGCTGCCTTAGAA 


627 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-08 


GGAACAACCTGCAGTTTGG 


628 


CSK 














CSK 


NM 004383 


.4758077 


1445 


D-0031 10-05 


CAGAATGTATTGGCAAGTA 


629 


CSK 


NM 004383 


4758077 


1445 


D-0031 10-06 


GAACAAAGTCGCC6TCAAG 


630 


CSK 


NM 004383 


4758077 


1445 


D-0031 10-07 


GCGAGTGCCTTATCCAAGA 


631 


CSK 


NM 004383 


4758077 


1445 


D-0031 10-08 


GGAGAAGGGCTACAAGATG 


632 


DDR1 














DDR1 


NM 013994 


7669484 


780 


D-0031 11 -05 


GG AGATGGAGTTTGAGTTT 


633 


DDR1 


NM 013994 


7669484 


780 


D-0031 11 -06 


CAGAGGCCCTGTCATCTTT 


634 


DDR1 


NM 013994 


7669484 


780 


D-0031 11 -07 


GCTGGTAGCTGTCAAGATC 


635 


DDRl 


NM 013994 


7669484 


780 


D-0031 11 -08 


TGAAAGAGGTGAAGATCAT 


636 


DDR2 
















INIVI UUO 1 


5453813 


4921 


D-0031 12-05 


GGTAAGAACTACACAATCA 


637~~ 


DDR2 


NM 006182 


5453813 


4921 


D-003112-06 


GAACGAGAGTGCCACCAAT 


638 


DDR2 


NM 006182 


5453813 


4921 


D-0031 12-07 


ACACCAATGTGAAGTTTAT 


6')Q 


DDR2 


NM 006182 


5453813 


4921 


D-0031 12-08 


CAAGAAGAATGCCAGGAAT 


640 


DKFZp761 
P1010 














DKFZp761 
P1010 


NM_018423 


8922178 


55359 


D-0031 13-05 


CCTAGAAGCTGCCATTAAA 


641 
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DKF2p761 
P1010 


NM_018423 


8922178 


55359 


D-003113-06 




RAO 


DKFZp761 
P1010 


IMM 018423 


80221 7ft 




U-UUo 1 1 0-U f 


UCOAlsTAeCTGCACACATA 


643 


nKF7n7fi1 
P1010 






oo3o9 


D-0031 1 3-08 


GGTGGTACCTGAACTGTAT 


644 


















NM 005228 


4885198 


1956 


D-0031 14-05 


GAAGGAAACTGAATTCAAA 


645 


tGFR 


NM 005228 


4885198 


1956 


D-0031 14-06 


GGAAATATGTACTACGAAA 


646 


EGFR 


NM 005228 


4885198 


1956 


D-0031 14-07 


CCACAAAGCAGT6AA7TTA 


647 


EGFR 


NM 005228 


4885198 


1956 


D-0031 14-08 


GTAACAA6CTCACGCAGTT 


648 


tPHAI 














EPHA1 


NM 005232 


4885208 


2041 


D-0031 15-05 


GACCAGAGCTTCACCATTC 


649 


EPHA1 


NM 005232 


4885208 


2041 


D-0031 15-06 


GCAAGACTGTGGCCATTAA 


650 


EPHA1 


NM 005232 


4885208 


2041 


D-0031 15-07 


GGGCGAAGCTGACCTATGA 


651 


EPHA1 


NM 005232 


4885208 


2041 


D-0031 15-08 


GATTGTAGCCGTCATCTTT 


652 


EPHA2 














EPHA2 


NM 004431 


4758277 


1969 


D-0031 16-05 


GGAGGGATGTGGCAACTTG 


653 


EPHA2 


NM 004431 


4758277 


1969 


D-0031 16-06- - 


GCAGCAAGGTGCACGAATT 


654 


EPHA2 


NM 004431 


4758277 


1969 


D-0031 16-07 


GGAGAAGGATGGGGAGTTC 


655 


EPHA2 


NM 004431 




1969 


D-003116-08 


GAAGTTCACTACCGAGATC 


656 


EPHA3 














EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-05 


GATCGGACCTCCAGAAATA 


657 


EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-06 


GAACTCAGCTCAGAAGATT 


658 


EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-07 


GCAAGAGGCACAAATGTTA 


659 


EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-08 


GAGCATCAGTTTACAAAGA 


660 


EPHA4 














EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-05 


GGTCTGGGATGAAGTATTT 


661 


EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-06 


GAATGAAGTrAGCTTATTG 


662 


EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-07 


GAACTTGGGTGGATAGCAA 


663 


EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-08 


GAGATTAAATTCACCTTGA 


664 


EPHA7 














EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-05 


GAAAAGAGATGTTGCAGTA 


665 


EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-06 


CTAGATGCCTCCTGTATTA 


666 


EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-07 


AGAAGAAGGTTATCGTTTA 


667 


EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-08 


TAGCAAAGCTGACCAAGAA 


668 


EPHA8 














EPHA8 


NM 020526 


4820*903^ 


- 2046 


D-0031 20-05 


GAAGATGCACTATCAGAAT 


669 


EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-06 


GAGAAGATGCACTATCAGA 


670 


EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-07 


AACCTGATCTCCAGTGTGA 


671 


EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-08 


TCTCAGACGTGGGCTATGT 


672 


EPHB1 














EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -05 


GCGATAAGCTCCAGCATTA 


673 


EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -06 


GAAACGGGCTTATAGCAAA 


674 


EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -07 


GGATGAAGATCTACATTGA 


675 


EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -08 


GCACGTCTCTGTCAACATC 


676 


p* ni too 

EPHB2 














EPHB2 


NM 017449 


17975764 


2048 


D-0031 22-05 


actatgagctgcagtacta 


677 


EPHB2 


NM 017449 


17975764 


2048 


D-0031 22-06 


GTACAACGCCACAGCGATA 


678 


bPHBZ 


NM 017449 


17975764 


2048 


D-0031 22-07 


GGAAAGCAATGACTGTTCT 


679 


EPHB2 


INIVI \j 1 f *tHo 






U-UUolZZ-Oo 


CeGACAAGCTGCAACACTA 


680 


EPHB3 














EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-05 


ggtgtgatctccaatgtga 


681 


EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-06 


GGGATGACCTCCTGTACAA 


682 


EPHB3 


NM 004443 


17975767 


2049 


D-003123-07 


CAGAAGACCTGCTCCGTAT 


683 


EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-08 


GAGATGAAGTACTTTGAGA 


684 
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EPHB4 














EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-05 


GGACAAACACGGACAGTAT 


685 


EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-06 


GTACTAAGGTCTACATCGA 


686 


EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-07 


GGAGAGAAGCAGAATATTC 


687 


EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-08 


GCCAATAGCCACTCTAACA 


688 


EPHB6 














EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-05 


GGAAGTCGATCCTGCTTAT 


689 


EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-06 


GGACGAAGGTGGACACAAT 


690 


EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-07 


TGTGGGAAGTGATGAGTTA 


691 


EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-08 


CGGGAGACGTTCACCCTTT 


692 


ERBB2 














ERBB2 


NM 004448 


4758297 


2064 


D-0031 26-05 


GGACGAATTCTGCACAATG 


693 


ERBB2 


NM 004448 


4758297 


2064 


D-0031 26-06 


GACGAATTGTGCACAATGG 


694 


ERBB2 


NM 004448 


4758297 


2064 


D-0031 26-07 


CTACAACACAGACACGTTT 


695 


ERBB2 


NM 004448 


4758297 


2064 


D-0031 26-08 


AGACGAA6CATACGTGATG 


696 


ERBB3 














ERBB3 


NM 001982 


4503596 


2065 


D-0031 27-05 


AAGAGGATGTCAACGGTTA 


697 


ERBB3 


NM 001982 


4503596 


2065 


D-0031 27-06 


GAAGACTGCCAGACATTGA 


698 


ERBB3 


NM 001982 


4503596 


2065 


D-0031 27-07 


GACAAACACTGGTGCTGAT 


699 


ERBB3 


NM 001982 


4503596 


2065 


D-0031 27r08 


GCAGTGGATTCGAGAAGTG 


700 


ERBB4 














ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-05 


GAGGAAAGATGCCAATTAA 


701 


ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-06 


GCAGGAAACATCTATATTA 


702 


ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-07 


GATCACAACTGCTGCTTAA 


703 


ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-08 


CCTCAAAGATACCTAGTTA 


704 


FER 














PER 


NM 005246 


4885230 


2241 


D-0031 29-05 


GGAGTGACCTGAAGAATTC 


705 


FER 


NM 005246 


4885230 


2241 


D-0031 29-06 


TAAAGCAGATTCCCATTAA 


706 


FER 


NM 005246 


4885230 


2241 


D-0031 29-07 


GGAAAGTACTGTCCAAATG 


707 


FER 


NM 005246 


4885230 


2241 


D-0031 29-08 


GAACAACGGCTGCTAAAGA 


708 


FES 














FES 


NM 002005 


13376997 


2242 


D-0031 30-05 


CGAGGATCGTGAAGCAGTA 


709 


FES 


NM 002005 


13376997 


2242 


D-0031 30-06 


AGGAATACCTGGAGATTAG-^ 


710 


FES 


NM 002005 


13376997 


2242 


D-0031 30-07 


CAACAGGAGCTCCGGAATG 


711 


FES 


NM 002005 


13376997 


2242 


D-0031 30-08 


GGTGTTGGGTGAGCAGATT 


712 


FGFR1 














FGFR1 


NM 000604 


13186232 


2260 


D-0031 31 -05 


TAAGAAATGTCTCCTTTGA 


713 


FGFR1 


NM 000604 


13186232. 


2260 


D-0031 31-06 


GAAGACTGCTGGAGTTAAT 


714 


FGFR1 


NM 000604 


13186232 


2260 


D-0031 31 -07 


GATGGTCCCTTGTATGTCA 


715 


FGFR1 


NM 000604 


13186232 


2260 


D-0031 31 -08 


CTTAAGAAAT6TCTCCTTT 


716 


FGFR2 














FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-05 


CCAAATCTCTCAACCAGAA 


717 


FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-06 


GAACAGTATTCACCTAGTT 


718 


FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-07 


G6CCAACACTGTGAAGTTT 


719 


FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-08 


GTGAAGATGTTGAAAGATG 


720 


FGFR3 














FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-05 


TGTGGGACCTGGTGTCTGA 


721 


FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-06 


GCATGAAGCTGCGGCATCA 


722 


FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-07 


6GACGGCACAGCCTACGTT 


723 


FGFR3 


NM 000142 


.13112046 


2261 


D-0031 33-08 


TGCACAACCTCGACTACTA 


724 


rGpR4 














FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-05 


6CACTGGAGTCTCGTGATG 


725 


FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-06 


CATAGGGACCTCTCGAATA 


726 


FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-07 


ATACGGACATCATCCTGTA 


727 


FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-08 


ATAGGGACCTCTCGAATAG 


728 


FGR 
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FGR 


NM 005248 


4885234 


2268 


D-0031 35-05 


GCGATCATGTGAAGCATTA 


729 


FGR 


NM 005248 


4885234 


2268 


D-0031 35-06 


TCACTGAGCTCATCACCAA 


730 


FGR 


NM 005248 


4885234 


2268 


D-0031 35-07 


GAAGAGTGGTACTTTGGAA 


731 


FGR 


NM 005248 


4885234 


2268 


D-0031 35-08 


CCCAGAAGCTGCCCTCTTT 


732 


FLT1 














FLT1 


NM 002019 


4503748 


2321 


D-0031 36-05 


GAGCAAACGTGACTTATTT 


733 


FLT1 


NM 002019 


4503748 


2321 


D-0031 36-06 


CCAAATGGGTTTCATGTTA 


734 


FLT1 


NM 002019 


4503748 


2321 


D-0031 36-07 


CAACAAGGATGCAGCACTA 


735 


FLT1 


NM 002019 


4503748 


2321 


D-0031 36-08 


GGACGTAACTGAAGAGGAT 


736 


FLT3 














FLT3 


NM 004119 


4758395 


2322 


D-0031 37-05 


GAAGGGATCTACACCATTA 


737 


FLT3 


NM 004119 


4758395 


2322 


D-0031 37-06 


GAAGGAGTCTGGAATAGAA 


738 


FLT3 


NM 004119 


4758395 


2322 


D-0031 37-07 


GAATTTAAGTCGTGTGTTC 


739 


FLT3 


NM 004119 


4758395 


2322 


D-0031 37-08 


GGAATTCATTTCACTCTGA 


740 


FLT4 














FLT4 


NM 002020 


4503752 


2324 


D-0031 38-05 


GCAAGAACGTGCATCTGTT 


741 


FLT4 


NM 002020 


4503752 


2324 


D-0031 38-06 


GCGAATACCTGTCCTACGA 


742 


FLT4 


NM 002020 


4503752 


2324 


D-0031 38-07 


GAAGACATTTGAGGAATTC 


743 


FLT4 


NM 002020 


4503752 


2324 


D;^031 38-08 


GAGCAGCCATTCATCAACA 


744 


FRK 














FRK 


NM 002031 


4503786 


2444 


D-0031 39-05 


GAAACAGACTCTTCATATT 


745 


FRK 


NM 002031 


4503786 


2444 


D-0031 39-06 


GAACAATACCACTCCAGTA 


746 


FRK 


NM 002031 


4503786 


2444 


D-0031 39-07 


CAAGACCGGTTCCTTTCTA 


747 


FRK 


NM 002031 


4503786 


2444 


D-0031 39-08 


GCAAGAATATCTCCAAAAT 


748 


FYN 












- ». 


FYN 


NM 002037 


23510344 


2534 


D-0031 40-05 


GGAATGGACTCATATGCAA 


749 


FYN 


NM 002037 


23510344 


2534 


D-0031 40-06 


GCAGAAGAGTGGTACrrTG 


750 


FYN 


NM 002037 


23510344 


2534 


D-0031 40-07 


CAAAGGAAGTTTACTGGAT 


751 


FYN 


NM 002037 


23510344 


2534 


D-003140-08 


GAAGAGTGGTACTTTGGAA 


752 


HCK 














HCK 


NM 002110 


4504356 


3055 


D-003141-05 


GAGATACCGTGAAACATTA 


753 


HCK 


NM 002110 


4504356 


3055 


D-003141-06 


GCAGGGAGATACCGTGAAA 


754 


HCK 


NM 002110 


4504356 


3055 


D-0031 41 -07 


CATCGTGGTTGCCCTGTAT 


i755 


HCK 


NM 002110 


4504356 


3055 


D-0031 41 -08 


TGTGTAAGATTGCTGACTT 


756 


ITK 














ITK 


NM 005546 


21614549 


3702 


D-0031 44-05 


CAAATAATCTGGAAACCTA 


757 


ITK 


NM 005546 


21614549 


3702 


D-0031 44-06 


GAAGAAACGAGGAATAATA 


758 


ITK 


NM 005546 


21614549. 


3702^ 


D-0031 44-07 


GAAACTCTCTCATCCCAAA 


759 


ITK 


NM 005546 


21614549 


3702 


D.003144-08 


GGAATGGGCATGAAGGATA 


760 


JAK1 














JAK1 


NM 002227 


4504802 


3716 


D-0031 45-05 


CCACATAGCTGATCTGAAA 


761 


JAK1 


NM 002227 


4504802 


3716 


D-0031 45-06 


TGAAATGACTCACATTGTA 


762 


JAK1 


NM 002227 


4504802 


3716 


D-0031 45-07 


TAAGGAACCTCTATCATGA 


763 


JAK1 


NM 002227 


4504802 


3716 


D-003145-08 


GCAGGTGGCTGTTAAATCT 


764 


JAK2 














JAK2 


NM 004972 


13325062 


3717 


D-003146-05 


GCAAATAGATCCAGTTCTT 


765 


JAK2 


NM 004972 


13325062 


3717 


D-0031 46-06 


GAGCAAAGATCCAAGACTA 


766 


JAK2 


NM 004972 


13325062 


3717 


D-0031 46-07 


GCCAGAAACTTGAAACTTA 


767 


JAK2 


NM 004972 


13325062 


3717 


D-0031 46-08 


GTACAGATTTCGCAGATTT 


768 


JAK3 














JAK3 


NM 000215 


4557680 


3718 


D-0031 47-05 


GCGCCTATCTTTCTCCTTT 


769 


JAK3 


NM 000215 


4557680 


3718 


D-003147-06 


CCAGAAATCGTAGACATTA 


770 


JAK3 


NM 000215 


4557680 


3718 


D-0031 47-07 


CCTCATCTCTTCAGACTAT 


771 


JAK3 


NM 000215 


4557680 


3718 


D-0031 47-08 


TGTACGAGCTCTTCACCTA 


772 


KDR 














KDR 


NM 002253 


11321596 


3791 


D-0031 48-05 


GGAAATCTCTTGCAAGCTA 


773 
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NM 002253 


11321596 


3791 


D-0031 48-06 


GATTACAGATCTCCATTTA 


774 




NM 002253 


11321596 


3791 


D-0031 48-07 


GCAGACAGATCTAGGTTTG 


775 




NM 002253 


11321596 


3791 


D-0031 48-08 


GCGATGGCCTCTTCTGTAA 


776 
















l/IA AH A7ft 


NM 014916 


7662475 


22853 


D-0031 49-05 


GAAATTCTCTCAACTGATG 


777 




NM 014916 


7662475 


22853 


D-0031 49-06 


GCAGAGGTCTTGACACTTT 


778 


1^1 A A-l A7fi 


NM 014916 


7662475 


22853 


D-0031 49-07 


TAAATGATCTTCAGACAGA 


779 


KiAAlU79 


NM 014916 


7662475 


22853 


D-0031 49-08 


GAGCAGCCCTACTCTGATA 


780 
















l/'IT 


NM 000222 


4557694 


3815 


D-0031 50-05 


AAACACGGCTTAAGCAATT 


781 


KIT 


Ik Ift A MAAAAA 

NM 000222 


4557694 


3815 


D-0031 50-06 


GAACAGAACCTTCACTGAT 


782 




NM 000222 


4557694 


3815 


D-0031 50-07 


GGGAAGCCCTCATGTCTGA 


783 


KIT 


NM_000222 


4557694 


3815 


D-0031 50-08 


GCAATTCCAITTATGTGTT 


784 


LCK 














LOK 


NM 005356 


20428651 


3932 


D-0031 51 -05 


GAACTGGCATTATCCCATA 


785 


LCK 


NM 005356 


20428651 


3932 


D-0031 51 -06 


GAGAGGTGGTGAAACATTA 


786 


LCK 


NM 005356 


20428651 


3932 


D-0031 51 -07 


GGGCGAAGTTTGCCATTAA 


787 


LCK 


NM 005356 


20428651 


3932 


D-0031 51-08 


GCACGCTGCTCATCCGAAA 


788 


1 Tlx 

LTK 















LTK 


NM 002344 


4505044 


4058 


D-0031 52-05 


TGAATTCACTCCTGCCAAT 


789 


1 Tlx 

LTK 


NM 002344 


4505044 


4058 


D-0031 52-06 


GTGGCAACCTCAAGACTGA 


790 


LTK 


NM 002344 


4505044 


4058 


D-0031 52-07 


GGAGGTAGCTGTGGATAAC 


791 


LTK 


NM 002344 


4505044 


4058 


D-0031 52-08 


GCAAGTTTCGCCATCAGAA 


792 


LYN 














LYN 


NM 002350 


4505054 


4067 


D-0031 53-05 


GCAGATGGCTTGTGCAGAA 


793 


LYN 


NM 002350 


4505054 


4067 


D-0031 53-06 


GGAGAAGGCTTGTATTAGT 


794 


LYN 


NM 002350 


4505054 


4067 


D-0031 53-07 


GATGAGCTCTATGACATTA 


795 


LYN 


NM 002350 


4505054 


4067 


D-0031 53-08 


GGTGCTAAGTTCCCTATTA 


796 


IV^ATK 














MATK 


NM 002378 


21450841 


4145 


D-0031 54-05 


TGAAGAATATCAAGTGTGA 


797 


tiA ATI/ 

MATK 


NM 002378 


21450841 


4145 


D-0031 54-06 


CCGCTCAGCTCCTGCAGTT 


798 


MATK 


NM 002378 


21450841 


4145 


D-0031 54-07 


TACTGAACCTGCAGCATTT 


799 


A ATIX 

MATK 


NM 002378 


21450841 


4145 


D-0031 54-08 


TGGGAGGTCTTCTCATATG 


80® 


MERTK 














MERTK 


NM 006343 


5453737 


10461 


D-0031 55-05 


GAACTTACCTTAGATAGCT 


801 


MERTK 


NM 006343 


5453737 


10461 


D-0031 55-06 


GGACCTGCATACTTACTTA 


802 


MERTK 


NM 006343 


5453737 


10461 


D-0031 55-07 


TGACAGGAATCTTCTAATT 


803 


MERTK 


NM 006343 


5453737 


10461- 


J3d3.031 55-08 


GGTAATGGCTCAGTCATGA 


804 


MET 














MET 


NM 000245 


4557746 


4233 


D-0031 56-05 


GAAAGAACCTCTCAACATT 


805 


MET 


NM 000245 


4557746 


4233 


D-0031 56-06 


GGACAAGGCTGACCATATG 


806 


MET 


NM 000245 


4557746 


4233 


D-0031 56-07 


CCAATGACCTGCTGAAATT 


807 


MET 


NM 000245 


4557746 


4233 


D-0031 56-08 


GAGCATACATTAAACCAAA 


808 


MoTIR 














MbTIR 


NM 002447 


4505264 


4486 


D-0031 57-05 


GGATGGAGCTGCTGGCTTT 


809 


Mo I 1R 


NM 002447 


4505264 


4486 


D-0031 57-06 


CTGCAGACCTATAGATTTA 


810 


Mol IK 


NM 002447 


4505264 


4486 


D-0031 57-07 


GCACCTGTCTCACTCTTGA 


811 


iviol 1R 


NM 002447 


4505264 


4486 


D-0031 57-08 


GAAAGAGTCCATCCAGCTA 


812 


iviUoK 














iiAi lot/ 


NM 005592 


5031926 


4593 


D-0031 58-05 


GAAGAAGCCTCGGCAGATA 


813 


MUSK 


NM 005592 


\J\J\J 1 \J £.\J 




U-UUO 1 OO-UD 


/— *T~ A ATA A*F/^T/-»/"» AT/"* AT^T 

t> 1 AA 1 AAT OTCCATCATGT 


814 


MUSK 


NM 005592 


5031926 


4593 


D-0031 58-07 


GGAATGAACTGAAAGTAGT 


815 


MUSK 


NM 005592 


5031926 


4593 


D-0031 58-08 


GAGATTTCCTGGACTAGAA 


816 


NTRK1 














NTRK1 


NM 002529 


4585711 


4914 


D-0031 59-05 


GGACAACCCTTTCGAGTTC 


817 


NTRK1 


NM 002529 


4585711 


4914 


D-0031 59-06 


CCAGTGACCTCAACAGGAA 


818 



111 



NIRK1 


NM 002529 


4585711 


4914 


D-0031 59-07 


CCACAATACTTCAGTGATG 


819 


NTRK1 


NM 002529 


4585711 


4914 


D-0031 59-08 


GAAGAGTGGTCTCCGTTTC 


820 


NTRK2 














NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-05 


GAACAGAAGTAATGAAATG 


821 


NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-06 


GTAATGCTGTTTCTGCTTA 


822 


NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-07 


GCAAGACAGTCCAAGTTTG 


823 


NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-08 


GAAAGTCTATCACATTATC 


824 


NTRK3 














NTRK3 


NM 002530 


4505474 


4916 


D-0031 61 -05 


GAGCGAATCTGCTAGTGAA 


825 


NTRK3 


NM_002530 


4505474 


4916 


D-0031 61 -06 


GAAGTTCACTACAGAGAGT 


826 


NTRK3 


NM 002530 


4505474 


4916 


D-0031 61 -07 


GGTCGACGGTCCAAATTTG 


827 


NTRK3 


NM 002530 


4505474 


4916 


D-0031 61 -08 


GAATATCACTTCCATACAC 


828 


PDGFRA 














PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-05 


GAAACTTCCTGGACTATrr 


829 


PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-06 


GAGATTTGGTCAACTATrT 


830 


PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-07 


GCACGGCGCTTCCTGATAT 


831 


PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-08 


CATCAGAGCTGGATCTAGA 


832 


PDGFRB 














PDGFRB 


NM 002609 


15451788 


5,159 


D-0031 63-05 


GAAAGGAGACGTCAAATAT 


833 


PDGFRB 


NM 002609 


15451788 


5159 


D-0031 63-06 


GGAATGAGGTGGTCAACTT 


834 


PDGFRB 


NM 002609 


15451788 


5159 


D-0031 63-07 


CAACGAGTCTCCAGTGCTA 


835 


PDGFRB 


NM 002609 


16451788 


5159 


D-0031 63-08 


GAGAGGAGGTGCCGAGCAA 


836 


PTK2 














PTK2 


NM 005607 


27886592 


5747 


D-0031 64-05 


GAAGTTGGGTTGTCTAGAA 


837 


PTK2 


NM 005607 


27886592 


5747 


D-0031 64-06 


GAAGAACAATGATGTAATC 


838 


PTK2 


NM 005607 


27886592 


5747 


D-0031 64-07 


GGAAATTGCTTTGAAGTTG 


839 


PTK2 


NM 005607 


27886592 


5747 


D-0031 64-08 


GGTTCAAGCTGGATTATTT 


840 


PTK2B 














PTK2B 


NM 004103 


27886583 


2185 


D-0031 65^5 


GAACATGGCTGACCTCATA 


841 


PTK2B 


NM 004103 


27886583 


2185 


D-0031 65-06 


GGACCACGCTGCTCTATTT 


842 


PTK2B 


NM 004103 


27886583 


2185 


D-0031 65-07 


GGACGAGGACTATTACAAA 


843 


PTK2B 


NM 004103 


27886583 


2185 


D-0031 65-08 


TGGCAGAGCTCATCAACAA 


844 


PTK6'5^ 














PTK6 


NM 005975 


27886594 


5753 


D-0031 66-05 


GAGAAAGTCCTGCCCGTTT 


845 


PTK6 


NM 005975 


27886594 


5753 


D-0031 66-06 


TGAAGAAGCTGCGGCACAA 


846 


PTK6 


NM 005975 


27886594 


5753 


D-0031 66-07 


CCGCGACTCTGATGAGAAA 


847 


PTK6 


NM 005975 


27886594 


5753 


D-0031 66-08 


TGCCCGAGCTTGTGAACTA 


848 


PTK7 














PTK7 


NM 002821 


27886610 


5754 


D-Q031 67-05 


GAGAGAAGCGGACTATTAA 


849 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-06 


CGAGAGAAGOCCACTATTA 


850 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-07 


GGAGGGAGTTGGAGATGTT 


851 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-08 


GAAGACATGCCGCTATTTG 


852 


PTK9 














PTK9 


NM 002822 


4506274 


5756 


D-0031 68-05 


GAAGAACTACGAGAGATTA 


853 


PTK9 


NM 002822 


4506274 


5756 


D-0031 68-09 


GAAGGAGACTATTTAGAGT 


854 


PTK9 


NM 002822 


4506274 


5756 


D-003168-10 


GAGCGGATGCTGTATTGTA 


855 


PTK9 


NM 002822 


4506274 


5756 


D-0031 68-11 


CTGCAGACTTCGTTTATGA 


856 


PTK9L 














PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-05 


AGAGAGAGGTCCAGCAGAT 


857 


PTK9L 


NM 007284 


31543446 


1.1344 


D-0031 69-06 


TTAACGAGGTGAAGACAGA 


858 


PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-07 


ACACAGAGCCCAGGGATGT 


859 


PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-08 


GCTGGGATCAGGACTATGA 


860 


RET 














RET 


NM 000323 


21536316 


5979 


D-0031 70-05 


GCAAAGACCTGGAGAAGAT 


861 


RET 


NM 000323 


21536316 


5979 


D-0031 70-06 


GCACACGGCTGCATGAGAA 


862 


RET 


NM 000323 


21536316 


5979 


D-0031 70-07 


GAACTGGCCTGGAGAGAGT 


863 



112 



RET 


NM 000323 


21536316 


5979 


D-0031 70-08 


TTAAATGGATGGCAATTGA 


864 


ROR1 














ROR1 


NM 005012 


4826867 


4919 


D-0031 71-05 


GCAAGCATCTTTACTAGGA 


865 


R0R1 


NM 005012 


4826867 


4919 


D-0031 71 -06 


GAGCAAGGGTAAAGAGCTA 


866 


R0R1 


NM 005012 


4826867 


4919 


D-0031 71 -07 


GAGAGCAACTTCATGTAAA 


867 


R0R1 


NM 005012 


4826867 


4919 


D-0031 71-08 


GAGAATGTCCTGTGTCAAA 


868 


R0R2 














R0R2 


NM 004560 


19743897 


4920 


D-0031 72-05 


GGAACTGGCTGCTGCCTAT 


869 


R0R2 


NM 004560 


19743897 


4920 


D-0031 72-06 


GCAGGTGCCTCCTCAGATG 


870 


R0R2 


NM 004560 


19743897 


4920 


D-0031 72-07 


GCAATGTGCTAGTGTACGA 


871 


R0R2 


NM 004560 


19743897 


4920 


D-0031 72-08 


GAAGACAGAATATGGTTCA 


872 


ROS1 














R0S1 


NM 002944 


19924164 


6098 


D-0031 73-05 


GAGGAGACCTTCTTACTTA 


873 


R0S1 


NM 002944 


19924164 


6098 


D-0031 73-06 


TTACAGAGGTTCAGGATTA 


874 


R0S1 


NM 002944 


19924164 


6098 


D-0031 73-07 


GAACAAACCTAAGCATGAA 


875 


R0S1 


NM 002944 


19924164 


6098 


D-0031 73-08 


GAAAGAGCACTTCAAATAA 


876 


RYK 














RYK 


NM 002958 


11863158 


6259 


D-0031 74-05 


GAAAGATGGTTACCGAATA 


877 


RYK 


NM 002958 


11863158 


6259 


D-0031 74-06 


CAAAGTAGATTCTGAAGTT 


878 


RYK 


NM 002958 


11863158 


6259 


D-0031 74-07 


TCACTACGGTCTATGCTTT 


879 


RYK 


NM_002958 


11863158 


6259 


D-0031 74-08 


6GTGAAGGATATA6CAATA 


880 


SRC 














SRC 


NM 005417 


21361210 


6714 


D-0031 75-05 


GAGAACCTGGTGTGCAAAG 


881 


SRC 


NM ,005417 


21361210 


6714 


D-0031 75-09 


GAGAGAACCTGGTGTGCAA 


882 


SRC 


NM 005417 


21361210 


6714 


D-0031 75-10 


GGAGTTTGCTGGACTTTCT 


883 


SRC 


NM 005417 


21361210 


6714 


D-0031 75-11 


GAAAGTGAGACCACGAAAG 


884 


SYK 














SYK 


NM 003177 


21361552 


6850 


D-0031 76-05 


GGAATAATCTCAAGAATCA 


885 


SYK 


NM 003177 


21361552 


6850 


D-0031 76-06 


GAACTGGGCTCTGGTAATT 


886 


SYK 


NM 003177 


21361552 


6850 


D-0031 76-07 


GGAAGAATCTGAGCAAATT 


887 


SYK 


NM 003177 


21361552 


6850 


D-0031 76-08 


GAACAGACATGTCAAGGAT 


888 


TEC 














TEC --r 


NM 003215 


4507428 


7006 


D-0031 77-05 


GAAATTGTCTAGTAAGTGA 


889 


TEC 


NM 003215 


4607428 


7006 


D-0031 77-06 


CACCTGAAGTGTTTAATTA 


890 


TEC 


NM 003215 


4507428 


7006 


D-0031 77-07 


GTACAAAGTCGCAATCAAA 


891 


TEC 


NM 003215 


4507428 


7006 


D-0031 77-08 


TGGAGGAGATTCTTATTAA 


892 


TEK - 














TEK 


NM 000459 


4557868 


7010 


D-003178rfl5 


GAAAGAATATGCCTCCAAA 


893- 


TEK 


NM 000459 


4557868 


7010 


D-003178-06 


GGAATGACATCAAATTTCA 


894 


TEK 


NM 000459 


4557868 


7010 


D-0031 78-07 


TGAAGTACGTGATATTCTA 


895 


TEK 


NM 000459 


4557868 


7010 


D-0031 78-08 


CGAAAGACCTACGTGAATA 


896 


TIE 














TIE 


NM 005424 


4885630 


7075 


D-0031 79-05 


GAGAGGAGGTTTATGTGAA 


897 


TIE 


NM 005424 


4885630 


7075 


D-0031 79-06 


GGGACAGCCTCTACCCTTA 


898 


TIE 


NM 005424 


4885630 


7075 


D-0031 79-07 


GAAGTTCTGTGCAAATTGG 


899 


TIE 


NM 005424 


4885630 


7075 


D-0031 79-08 


CAACATGGCCTCAGAACTG 


900 


TNK1 














TNK1 


NM 003985 


4507610 


8711 


D-0031 80-05 


GTTCTGGGCCTAAGTCTAA 


901 


TNK1 


NM 003985 


4507610 


8711 


D-0031 80-06 


GAACTGGGTCTACAAGATC 


902 


TNK1 


NM 003985 


4507610 


8711 


D-0031 80-07 


CGAGAGGTATCGGTCATGA 


903 


1 INix 1 




4507610 


8711 


D-0031 80-08 


GGCGCATCCTGGAGCATTA 


904 


TXK 














TXK 


NM 003328 


4507742 


7294 


D-0031 81 -05 


GAACATCTATTGAGAGAAG 


905 


TXK 


NM 003328 


4507742 


7294 


D-0031 81 -06 


TCAAGGCACTTTATGATTT 


906 


TXK 


NM 003328 


4507742 


7294 


D-0031 81 -07 


GGAGAGGAATGGCTATATT 


907 


TXK 


NM 003328 


4507742 


7294 


D-0031 81-08 


GGATATATGTGAAGGAATG 


908 



113 



TYK2 














TYK2 


NM 003331 


4507748 


7297 


D-0031 82-05 


GAGGAGATCCACCACTTTA 


909 


TYK2 


NM 003331 


4507748 


7297 


D-0031 82-06 


GGATCCACATTGCACATAA 


910 


TYK2 


NM 003331 


4507748 


7297 


D-0031 82-07 


TGAAATACCTAGGCACACT 


911 


TYK2 


NM 003331 


4507748 


7297 


D-0031 82-08 


CAATCTTGCTGAGGTCTTG 


912 


TYR03 














TYR03 


NM 006293 


27597077 


7301 


D-0031 83-05 


GGTAGAAGGTGTGGCATTT 


913 


TYR03 


NM 006293 


27597077 


7301 


D-0031 83-06 


AGGCTGAGATTTAGAAGTA 


914 


TYR03 


NM 006293 


27597077 


7301 


D-0031 83-07 


GGATGGCTCCTTTGTGAAA 


915 


1 YKUo 


Nm 00o293 


27597077 


7301 


D-0031 83-08 


6AGA6GAACTACGAAGATC 


916 


YES1 














YES1 


NM 005433 


21071041 


7525 


D-0031 84-05 


GAAGGACCCTGATGAAAGA 


917 


YES1 


NM 005433 


21071041 


7525 


D-0031 84-06 


TAAGAAGGGTGAAAGATTT 


918 


YES1 


NM 005433 


21071041 


7525 


D-0031 84-07 


TCAAGAAGCTCAGATAATG 


919 


YES1 


NM 005433 


21071041 


7525 


D-0031 84-08 


CAGAATOCCTCCATGAATT 


920, 



Table Vra 



Gene 
Name 


Acc# 


Gl 


Locus 
Link 


Dunlex M 


run octjuonuc 




APC2 














APC2 


NM 013366 


7549800 


29882 


D-003200-05 


GCAAGGACCTCTTCATCAA 




APC2 


NM 013366 


7549800 


29882 


D-003200-06 


GAGAAGAAGTCCACACTAT 


922 


APG2 


NM 013366 


7549800 


29882 


D-003200-07 


GGAATGCCATCTCCCAATG 


923 


APC2 


NM 013366 


7549800 


29882 


D-Q03200-09 


CAACAC6TGTGACATCATC 


924 


ATM 














ATM 


NM 000051 


20336202 


472 


D-003201-05 


GCAAGCAGCTGAAACAAAT 


925 


ATM 


NM 000051 


20336202 


472 


D-003201-06 


GAATGTTGCTTTCTGAATT 


926 


ATM 


NM 000051 


20336202 


472 


D-003201-07 


GACCTGAAGTCTTATTTAA 


927 


ATM 


NM 000051 


20336202 


472 


D-003201-08 


AGACAGAATTCCCAAATAA 


928 


ATR 














ATR 


NM 001184 


20143978 


545 


D-003202-05 


GAACAACACTGCTGGTTTG 


929 


ATR 


NM 001184 


20143978 


545 


D-003202-06 


GAAGTCATCTGTTCATTAT 


930 


ATR 


NM 001184 


20143978 


545 


D-003202-07 


GAAATAAGGTAGACTCAAT 


931 


ATR 


NM 001184 


20143978 


545 


D-003202-08 


CAACATAAATCCAAGAAGA 


932 


BTAK - 














BTAK 


NM 003600 


3213196 


6790 


D-003545-04 


CAAAGAATCAGCTAGGAAA 


933 


BTAK 


NM 003600 


3213196 


6790 


D-003203-05 


GAAGAGAGTTATTCATAGA 


934 


BTAK 


NM 003600 


3213196 


6790 


D-003203-07 


CAAATGCCCTGTCTTACTG 


935 


STK6 


NM 003600 


3213196 


6790 


D-003203-09 


TCTCGTGACTCAGCAAATT 


936 


CCNA1 














CCNA1 


NM 003914 


16306528 


8900 


D-003204-05 


GAACCTGGCTAAGTACGTA 


937 


CCNA1 


NM 003914 


16306528 


8900 


D-003204-06 


GCAGATCCATTCTTGAAAT 


938 


CCNA1 


NM 003914 


16306528 


8900 


D-003204-07 


TCACAAGAATGAGGTGTTA 


939 


CCNA1 


NM 003914 


16306528 


8900 


D-003204-08 


CATAAAGCGTACCTTGATA 


940 


CCNA2 














CCNA2 


NM 001237 


16950653 


890 


D-003205-05 


GGTGTGAACTACATTGATA 


941 


CCNA2 


NM 001237 


16950653 


890 


D-003205-06 


GATGATAGCTACAGCAAGA 


942 


CCNA2 


NM 001237 


16950653 


890 


D-003205-07 


GCTGTTAGCCTCAAAGTTT 


943 


CCNA2 


NM 001237 


16950653 


890 


D-003205-08 


AAGCT6GCCTGAATCATTA 


944 


CCNB1 














CCNB1 


NM 031966 


14327895 


891 


D-003206-05 


CAACATTACCTGTGATATA 


945 


CCNB1 


NM 031966 


14327895 


891 


D-003206-06 


CCAAATACCTGATGGAACT 


946 


CCNB1 


NM 031966 


14327895 


891 


D-003206-07 


GAAATGTACCCTCCAGAAA 


947 



114 





KIM H'^iOAft 


14oz/^o95 


891 


D-003206-08 


GCACCTGGCTAAGAATGTA 


948 


















INIVI UUh/UI 


109oo017 


9133 


D-003207-05 


CAACAAATGTCAACAAACA 


949 




INIVI UU4/U1 


10938017 


9133 


D-003207-06 


GCAGCAAACTCCTGAAGAT 


950 


PPM DO 


INIVI UU4fU1 


'IAAOOA'4 ~r 

10938017 


9133 


D-003207-07 


CCAGTGATTTGGAGAATAT 


951 




MM (\(\A7n^ 

INIVI UU4/UI 


4 AAOO A4 ~/ 

10938017 


9133 


D-003207-08 


GTGACTACGTTAA6GATAT 


952 


















INIVI UOOUol 


14719419 


85417 


D-003208-05 


TGAACAAACTGCTGACTTT 


953 




INM U>5oUo1 


14719419 


85417 


D-003208-06 


GCTAGCTGCTGCCTCCTTA 


954 


PPMRO 


InM UooUoI 


14719419 


85417 


D-003208-07 


CAAGTCACCTCGTGTGGAT 


955 


PPMR** 


InM UooUoI 


A A^ A t\ A A f\ 

14719419 


85417 


D-003208-08 


GTGGATCTCTACCTAATGA 


956 


ppMp 














PPMP 


NM uuoiyo 


7382485 


892 


D-003209-05 


GCAGAGCTCCCACTATTTG 


957 




NM ouoiyo 


7382485 


892 


D-003209-06 


GGAGTAGTTTCAAATAGAA 


958 




InM UU5j9u 


7382485 


892 


D-003209-07 


GACCTTTGCTCCAGTATGT 


959 




NM 005190 


7382485 


892 


D-003209-08 


GAGATTCTATGCCAGGTAT 


960 


ppMn-i 














pPMn-i 


NM 053056 


16950654 


595 


D-003210-05 


TGAACAAGCTCAAGTGGAA 


961 




NM .053056 


16950654 


595 


D-003210-06 


CGAGAGTGATCAAGTGTGA 


962 




NM 053056 


16950654 


595 


D-003210-07 


GTTCGTGGCCTCTAAGATG 


963 


UOIMU1 


NM 053056 


16950654 


695 


D-003210-08 


CCGAGAAGGTGTGCATCTA 


964 


















NM_001759 


16950656 


894 


D-003211-06 


TGAATTACCTGGACCGTTT 


965 




NM 001759 


16950656 


894 


D-003211-07 


CGGAGAAGCTGTGCATTTA 


966 




NM 001769 


16950656 


894 


D-003211-08 


ctacagacgtgcgggatat 


967 




NM 001759 


16950656 


894 


D-003211-09 


CAACACAGAC6TGGATTGT 


968 


















NM 001760 


16950657 


896 


D-003212-05 


GGACCTGGCTGCTGTGATT 


969 




KIRA AAi4'7^n 

NM 001760 


16950657 


896 


D-003212-06 


GATTATACCTTTGCCATGT 


970 




NM 001760 


16950657 


896 


D-003212-07 


GAGCAGCACTCCTACAGAT 


971 




NM 001760 


16950657 


896 


D-003212-08 


TGCGGAAGATGCTGGCTTA 


972 


















NM 001238 


17318558 


898 


D-003213-05 


GTACTGAGCTGGGCAAATA 


973 


^wINt 1 


NM_001238 


17318558 


898 


D-003213-06 


GGAAATCTATCCTCCAAAG 


974 




NM 001238 


17318558 


898 


D«003213-07 


GGAGGTGTGTGAAGTCTAT 


975 




NM 001238 


17318558 


898 


D-003213-08 


CTAAATGACTTACATGAAG 


976 


















NM 057749 


17318564 


9134 


D-003214-05 


GGATQGAACTCATTATATT 


977 




NM 057749 


17318564 


9134 


D-003214-06 


GCAGATATGTTCATGACAA 


978 




NM_057749 


17318564 


9134 


D-003214-07 


CATAATATCCAGACACATA 


979 




NM 057749 


17318564 


9134 


D-003214-08 


TAAGAAAGCCTCAGGTTTG 


980 
















PPMP 


NM 001761 


4502620 


899 


D-003215-05 


TCACAAAGCATCCATATTG 


981 


PPMP 


INM UU1 /Ol 


4502620 


899 


D-003215-06 


GAAGTCATGTTTACAGTGT 


982 


PPMC 


NM 001761 


4502620 


899 


D-003215-07 


TAGCCTACCTCTACAATGA 


983 


PPMF 


NM 00l7o1 


4502620 


899 


D-003215-08 


GGACCGGGTTTATCAGTAA 


984 
















pp Mr^i 


Nm 004060 


8670528 


900 


D-003216-05 


GATAATGGCGTCAGAATGA 


985 


PPMf^l 


NM 0040oU 


8670528 


900 


D-003216-06 


GCAGGGCAATTGAAGCATA 


986 




INM 0040oU' 


8670528 


900 


D-003216-07 


GGAATAGAATGTCTTCAGA 


987 


CCNG1 


NM 004060 


8670528 


900 


D-00321fi-Oft 




988 


CCNG2 














CCNG2 


NM 004354 


4757935 


901 


D-G03217-05 


GGAGAGAGTTGGTTTCTAA 


989 


CCNG2 


NM 004354 


4757935 


901 


D-003217-06 


GGTGAAACGTAAACATTTG 


990 


CCNG2 


NM 004354 


4757935 


901 


D-003217-07 


GAAATACTGAGCCTTGATA 


991 


CCNG2 


NM 004354 


4757935 


901 


D-003217-08 


TGCCAAAGTTGAAGATTTA 


992 



115 



CCNH 














CCNH 


NM 001239 


17738313 


902 


D-003218-05 


GCTGATGACTTTCTTAATA 


993 


CCNH 


NM 001239 


17738313 


902 


D-003218-06 


CAACTTAATTTCCACCTTA 


994 


CCNH 


NM 001239 


17738313 


902 


D-003218-07 


ATACACACCTTCCCAAATT 


995 


CCNH 


NM 001239 


17738313 


902 


D-003218-08 


GCTATGAAGATGATGATTA 


996 


CCNI 














CCNI 


NM 006835 


17738314 


10983 


D-003219-05 


GCAAGCAGACCTCTACTAA 


997 


CCNI 


NM 006835 


17738314 


10983 


D-003219-07 


TGAGAGAATTCCAGTACTA 


998 


CCNI 


NM 006835 


17738314 


10983 


D-003219-08 


GGAATCAAACGGCTCTATA 


999 


CCNI 


NM 006835 


17738314 


10983 


D-003219-09 


GAATTGGGATCTTCACACA 


1000 


CCNT1 














CCNT1 


NM 001240 


17978465 


904 


D-003220-05 


TATCAACACTGCTATAGTA 


1001 


CCNT1 


NM 001240 


17978465 


904 


D-003220-06 


GAACAAACGTCCTGGTGAT 


1002 


CCNT1 


NM 001240 


17978465 


904 


D-003220-07 


GCACAAGACTCACCCATCT 


1003 


CCNT1 


NM 001240 


17978465 


904 


D-003220-08 


GCACAGACTTCTTACTTCA 


1004 


CCNT2A 














CCNT2A 


NM 001241 


17978467 


905 


D-003221-05 


GCACAGACATCCTAT7TCA 


1005 


CCNT2A 


NM 001241 


17978467 


905 


D-003221-06 


GCAGGGACCTTCTATATCA 


1006 


CCNT2A 


NM 001241 


,17978467 


905 


D-003221-07 


GAACAGCTATATTCACAGA 


1007 


CCNT2A 


NM 00124t 


17978467 


905 


D-003221-09 


TTATATAGCTGCCCAGGTA 


1008 


CCNT2B 














CCNT2B 


NM 058241 


17978468 


905 


D-003222-05 


GCACAGACATCCTATTTCA 


1009 


CCNT2B 


NM 058241 


17978468 


905 


D-003222-06 


GCAGGGACCTTCTATATCA 


1010 


CCNT2B 


NM 058241 


17978468 


905 


D-003222-07 


GAACAGCTATATTCACAGA 


1011 


CCNT2B 


NM 058241 


17978468 


905 


D-003222-08 


GGTGAAATGTACCCAGTTA 


1012 


CDC16 














CDC16 


NM 003903 


14110370 


8881 


D-003223-05 


GTAGATGGCTTGCAAGAGA 


1013 


CDC16 


NM 003903 


14110370 


8881 


D-003223-06 


TAAAGTAGCTTCACTCTCT 


1014 


CDC16 


NM 003903 


14110370 


8881 


D-003223-07 


GCTACAAGCTTACTTCTGT 


1015 


CDC16 


NM_003903 


14110370 


8881 


D-003223^8 


TGGAAGAGCCCATCAATAA 


1016 


CDC2 














CDC2 


NM 033379 


27886643 


983 


D-003552-01 


GTACAGATCTCCAGAAGTA 


1017 


CDC2 


NM 033379; 


27886643 


983 


D-003552-02 


GATCAACTCTTCAGGATTT 


1018 


CDC2 


NM 033379 


27886643 


983 


D-003552-03 


GGTTATATCTCATCTTTGA 


1019 


CDC2 


NM 033379 


27886643 


983 


D-003552-04 


GAACTTCGTCATCCAAATA 


1020 


CDC20 














CDC20 


NM 001255 


4557436 


991 


D-003225-05 


GGGAATATATATCCTCTGT 


1021 


CDC20 


NM 001255 


4557436 


991 


D-003225-06 


GAAACGGCTTCGAAATATG 


1022 


CDC20 


NM 001255 


4557436 


991 


D-003225-07 


GAAGACCTGCCGTTACATT 


1023 


CDC20 


NM 001255 


4557436 


991 


O-003225-08 


CACCAGTGATCGACACATT 


1024 


CDC25A 














CDC25A 


NM 001789 


4502704 


993 


D-003226-05 


GAAATTATGGCATCTGTTT 


1025 


CDC25A 


NM. 001789 


4502704 


993 


D-003226-06 


TACAAGGAGTTCTTTATGA 


1026 


CDC25A 


NM 001789 


4502704 


993 


D-003226-07 


CCACGAGGACTTTAAAGAA 


1027 


CDC25A 


NM 001789 


4502704 


993 


D-003226-08 


TGGGAAACATCAGGATTTA 


1028 


CDC25B 














CDC25B 


NM 004358 


11641416 


994 


D-003227-05 


GCAGATACCCCTATGAATA 


1029 


CDC25B 


NM 004358 


11641416 


994 


D-003227-06 


CTAGGTCGCTTCTCTCTGA 


1030 


CDC25B 


NM 004358 


11641416 


994 


D-003227-07 


GAGAGCTGATTGGAGATTA 


1031 


CDC25B 


NM 004358 


11641416 


994 


D-003227-08 


AAAAGGACCTCGTCATGTA 


1032 
















CDC25C 


NM 001790 


12408659 


995 


D-003228-05 


GAGCAG AAGTGG CCTATAT 


1033 


CDC25C 


NM 001790 


12408659 


995 


D-003228-06 


CAGAAGAGATTTCAGATGA 


1034 


CDC25C 


NM 001790 


12408659 


995 


D-003228-07 


CCAGGGAGCCTTAAACTTA 


1035 


CDC25C 


NM 001790 


12408659 


995 


D-003228-08 


GAAACTTGGTGGACAGTGA 


1036 


CDC27 















116 



ODC27 


NM 001256 


16554576 


996 


D-O03229-06 


CATGCAAGCTGAAAGAATA 


1037 


CDC27 


NM 001256 


16554576 


996 


D-003229-07 


CAACACAAGTACGTAATCA 


1038 


OL)027 


NM 001256 


16554576 


996 


D-003229-08 


GGAGATGGATCCTATTTAC 


1039 




NM 001256 


16554576 


996 


D-003229-09 


GAAAAGCCATGATGATATT 


1040 


CDC34 














CDC34 


NM 004359 


16357476 


997 


D-003230-05 


GCTCAGACCTCTTCTACGA 


1041 


CDC34 


NM 004359 


16357476 


997 


D-003230-06 


GGACGAGGGCGATCTATAC 


1042 


CDC34 


NM 004359 


16357476 


997 


D-003230-07 


GATCGGGAGTACACAGACA 


1043 


CDC34 


NM 004359 


16357476 


997 


D-003230-08 


TGAACGAGCCCAACACCTT 


1044 


CDC37 














CDC37 


NM 007065 


16357478 


11140 


D-003231-05 


GCGAGGAGACAGGCAATTA 


1045 


CDC37 


NM 007065 


16357478 


11140 


D-003231-06 


CACAAGACCTTCGTGGAAA 


1046 


CDC37 


NM 007065 


16357478 


11140 


D-003231-07 


ACAATCGTCATGCAATTTA 


1047 


CDC37 


NM 007065 


16357478 


11140 


D-003231-08 


GAGGAGAAATGTGCACTCA 


1048 


CDC45L 














CDC45L 


NM 003504 


34335230 


8318 


D-003232-05 


GGACACGGATCTCCTTTGA 


1049 


CDC45L 


NM 003504 


34335230 


8318 


D-003232-06 


GCAAACACCTGCTCAAGTC 


1050 


CDC45L 


NM 003504 


34335230 


8318 


D-003232-07 


TGAAGAGTCTGCAAATAAA 


1051 


CDC45L . 


NM 003504 


34335230 


8318 


D-003232-08 


GGACGTGGATGCTCTGTGT 


1052 


CDC6 










- 




CDC6 


NM 001254 


16357469 


990 


D-003233-05 


GAACACAGCTGTCCCAGAT 


1053 


CDC6 


NM 001254 


16357469 


990 


D-003233-06 


GAGCAGAGATGTCCACTGA 


1054 


CDC6 


NM 001254 


16357469 


990 


D-003233-07 


GGAAATATCTTAGCTACTG 


1055 


CDC6 


NM 001254 


16357469 


990 


D-003233-08 


GGACGAAGATTGGTATTTG 


1056 


CDC7 














CDC7 


NM 003603 


11038647 


8317 


D-003234-05 


GGAATGAGGTACCTGATGA 


1057 


CDC7 


NM 003503 


11038647 


8317 


D-003234-06 


CAGGAAAGGTGTTCACAAA 


1058 


CDC7 


NM 003503 


11038647 


8317 


D-003234-07 


CTACACAAATGCACAAATT 


1059 


CDC7 


NM 003503 


11038647 


8317 


D-003234-08 


GTACG6GAATATATGCTTA 


1060 


CDK10 














CDK10 


NM 003674 


32528262 


8558 


D-003235-05 


GAACTGCTGTTGGGAACCA 


1061 


CDK10 


NM 003674 


32528262 


8558 


D-003235-06 


GGAAGCAGCCCTACAACAA 


1062 


CDK10 


NM 003674 


32628262 


8558 


D-003235-07 


GCACGCCCAGTGAGAACAT 


1063 


CDK10 


NM 003674 


32528262 


8558 


D-003235-08 


GGAAGCAGCCCTACAACAA 


1064 


CDK2 














CDK2 


NM 001798 


16936527 


1017 


D-003236-05 


GAGCTTAACCATCCTAATA 


1065 


CDK2 


NM 001798 


16936527 


1017 


D-003236-06 


GAGCTTAACCATCCTAATA 


1066 


CDK2 


NM 001798 


16936527 


1017 


D-003236-O7 


GTACCGAGCICCTGAAATC 


1067 


CDK2 


NM 001798 


16936527 


1017 


D-003236-08 


GAGAGGTGGTGGCGCTTAA 


1068 


CDK3 














CDK3 


NM 001258 


4557438 


1018 


D-003237-05 


GAGCATTGGTTGCATCTTT 


1069 


CDK3 


NM 001258 


4557438 


1018 


D-003237-06 


GATCGGAGAGGGCACCTAT 


1070 


CDK3 


NM 001258 


4557438 


1018 


D-003237-07 


GAAGCTCTATCTGGTGTTT 


1071 


CDK3 


NM 001258 


4557438 


1018 


D-003237-08 


GCAGAGATGGTGACTCGAA 


1072 


CDK4 














CDK4 


NM 000075 


456426 


1019 


D-003238-05 


GCAGCACTCTTATCTACAT 


1073 


CDK4 


NM 000075 


456426 


1019 


D-003238-06 


GGAGGAGGCCTTCCCATCA 


1074 


CuK4 


NM 000075 


456426 


1019 


D-003238-07 


TCGAAAGCCTCTCTTCTGT 


1075 


CDK4 


NM _000075 


456426 


1019 


D-003238-08 


GTACCGAGCTCCCGAAGTT 


1076 


CDK5 














CDK5 


KIM ^^4Q'^'^ 

INI VI v/Viti/OO 


*rO^OO ( H 




U-UUO/toVMJO 


I GjACCAAGCTGCCAG ACTA 


1077 


CDK5 


NM 004935 


4826674 


1020 


D-003239-06 


GAGCTGAAATTGGCTGATT 


1078 


CDK5 


NM 004935 


4826674 


1020 


D-003239-07 


CAACATCCCTGGTGAACGT 


1079 


CDK5 


NM 004935 


4826674 


1020 


0-003239-08 


GGATTCCCGTCCGCTGTTA 


1080 


CDK6 














CDK6 


NM 001259 


16950658 


1021 


D-003240-05 


GCAAAGACCTACTTCTGAA 


1081 



117 



CDK6 


NM 0017'^Q 

I^IVI V/v/ 1 ^Ov7 


1 Uv70UUuO 


1 UZ1 


U-003240-06 


GAAGAAGACTGGCCTAGAG 


1082 


CDK6 




lOUOUODo 


^ AOH 

lUZI 


D-003240-07 


GGTCTGGACTTTCTTCATT 


1083 


CDK6 




1 QslOUDOo 


i AOi 

1UZ1 


D-003240-08 


TAACAGATATCGATGAACT 


1084 


CDK7 














CDK7 


NM 001 7QQ 


1 oy ouooy 


A AOO 

lUZz 


D-003241-05 


GGACATAGATCAGAAGCTA 


1085 


CDK7 


NM 001 7QQ 


loyouDoy 


i AOO 

lUZz 


D-003241-06 


CAATAGAGCTTATACACAT 


1086 


CDK7 


MM 001 7QQ 


1 oyouooy 


<f AOO 

1022 


D-003241~07 


CATACAAGGCTTATTCTTA 


1087 


CDK7 


NM 001 7QQ 


1 oyouooy 


i AOO 

1022 


D-003241 -08 


GGAGACGACTTACTAGATC 


1088 


CDK8 
















IN IVI UU 1 zou 


4oUZf 44 


1024 


D-003242-05 


CCACAGTAGTCACATCAGA 


1089 


CDK8 


NM nni9Rn 

IN IVI UU 1 ^DU 


40UZ / 44 


1024 


D-003242-06 


GCAATAACCACACTAATGG 


1090 




NM nniofin 

IN IVI UU 1 zou 


4oUz / 44 


1024 


D-003242-07 


GAAGAAAGTGAGAGTTGTT 


1091 


CDKft 


IN IVI UU 1 ZDU 


4oUZf 44 


1024 


D-003242-08 


GAACATGACCTCTGGCATA 


1092 
















CDKQ 


IN IVI UU IZ01 


1 TA-I 7AOO 


1025 


D-003243-05 


GGCCAAACGTGGACAACTA 


1093 


PDKQ 


IN IVI UU1ZOI 


•i 7r\^ "700 0 


1025 


D-003243-06 


TGACGTCCATGTTCGAGTA 


1094 


PHKQ 


KIM nniOAi 


1 /Ul 7983 


1025 


D-003243-07 


CCAACCAGACGGAGTTTGA 


1095 


nnKQ 


K|M nniOfti 

INIVI UUIilOl 


1/017983 


1025 


D-003243-08 


GAAGGTGGCTCTGAAGAAG 


1096 


r;nKMip 

^L/fxIN 1 O 
















INIVI UUUUfO 


AtLCtA At\ 

45O7440 


1028 


D-003244-05 


GACCAGAACCGCTGGGATT 


1097 




rliVl UUUUfD 


4557440 


1028 


D-003244-06 


GGACCGAAGTGGACAGCGA 


1098 


PDKWIP 


INIVI UUUUfO 


4557440 


1028 


D-003244-08 


GCAAGAGATCAGCGCCTGA 


1099 


PHKMir^ 

v^UrVIN 1 0 


iNm UUUu7d 


4557440 


1028 


D-003244-09 


CCGCTGGGATTACGACTTC 


1100 


















iMM UU4yoD 


17981693 


1030 


D-003245-05 


GCGAGGAGAACAAGGGCAT 


1101 




InM UU4yoO 


17981693 


1030 


D-003245-06 


CCAACGGAGTCAACCGTTT 


1102 




IN IVI 


17981693 


1030 


D-^03245-07 


CGATCCAGGTCATGATGAT 


1103 




Nivl UU49oo 


17981693 


1030 


D-003245-08 


CCTGGAAGCCGGCGCGGAT 


1104 


















INIVI UUlzoZ 


17981697 


1031 


D-003246-05 


GGACACCGCCTGTGATTTG 


1105 




INIVI UU1 ZDZ 


A ^i\r%A 

17981697 


1031 


D-003246-06 


GCCAGGAGACTGCTACTTA 


1106 




KttiA Art-IOC) 
INm UUIZDZ 


1 7981 697 


1031 


D-003246>07 


TGAAAGACCGAACTGGTTT 


1107 


pnKM9p 


INIVI UUizo^: 


17981^7 


1031 


D-003246-08 


GAACCTGCCCTTGCACTTG 


1108 


















KIK^ AA-IQAA 
iNm UUloUU 


1 7981 700 


1032 


D-003247-05 


TGGCAGTTCAAGAGGGTCA 


1109 




INIVI UUlOUU 


1 / 981 700 


1032 


D-003247-06 


CTCAGGACCTCGTGGACAT 


1110 




NM nni AAA 
INIVI_UU 1 oUU 


l/SolvOO 


1032 


D-003247-07 


TGAAGGTCCTAGTGGAGCA 


1111 




NM nni AAA 

INIVI_UU 1 oUU 


A TOO A T/^n 

•1 7981 700 


1032 


D-003247-08 


AGACGGCGCTGCAQGTCAT 


1112 


CDTI 














PDTI 


KIKyi A1AA9Q 

IN IVI uouyzo 


19923847 


81620 


D-003248-05 


CCAAGGAGGCACAGAAGCA 


1113 


CDTI 


MM AQAQOQ 

INIVI uouyzcJ 


19923847 


81620 


D-003248-06 


GCTTCAACGTGGATGAAGT 


1114 


1 1 


MM A'^AOOA 

iNiw uouyzo 


1 9923847 


81620 


D-003248-07 


TCTCCGGGCCAGAAGATAA 


1115 


CDTI 


MM A'^QOA 
INIVI UOUyZo 


1992oo47 


81620 


D-003248-08 


GCGCAATGTTGGCCAGATC 


1116 


CEWPA 














CFNPA 


MM nniAno 

INIVI UU lOUy 


4OOOOOI 


1058 


D-003249-05 


GCAGACACCTCTTGATAAG 


1117 


CFNPA 


NM OniAAO 
INIVI UU lOUy 


400000 1 


A Am 

1058 


D-003249-06 


GCAAGAGAAATATGTGTTA 


1118 


CENPA 


NM nniAAQ 

INIVI UUIoUy 


4OOOOOI 


1058 


D-003249-07 


TTACATGCAGGCCGAGTTA 


1119 


CFNPA 


MM OniAAQ 
IMIVI UU 1 OUv7 


4000001 


1058 


D-003249-08 


GAGACAAGGTTGGCTAAAG 


1120 


CFNPR 














CFNPR 


NM nniftiA 
INIVI UU loiU 


ZO 105977 


1059 


D-003250-05 


GGACATAGCCGCGTGCTTT 


1121 


CENPB 


NM 001810 


26105977 


1059 


D-003250-06 


GCACttATPPTnAAriAAPAA 




CENPB 


NM 001810 


26105977 


1059 


D-003250-07 


GGAGGAGGGTGATGTTGAT 


1123 


CENPB 


NM 001810 


26105977 


1059 


D-003250-08 


CCGAATGGCTGCAGAGTCT 


1124 


CENPC1 














CENPC1 


NM 001812 


4502778 


1060 


D-003251-05 


GCGAATAGATTATCAAGGA 


1125 


CENPC1 


NM 001812 


4502778 


1060 


D-003251-06 


GAACAGAATCCATCACAAA 


1126 



118 





NM UOI0I2 


4502778 


1060 


D-003251-07 


CCATAAACCTCACCCAGTA 


1127 




Klh/I rtA'IO'lO 

InM UUIolZ 


4502778 


1060 


D-003251-08 


CAAGAGAACACGTTTGAAA 


1128 


LybNrb 














UcNr't 


NM OOI0I3 


4502780 


1062 


D-003252-05 


GAAGACAGCTCAAATAATA 


1129 




NM 001813 


4502780 


1062 


D-003252-06 


CAACAAAGCTACTAAATCA 


1130 


LrbNrb 


NM 001813 


4502780 


1062 


D-003252-07 


GGAAAGAAGTGCTACCATA 


1131 


ubNrc 


NM 001813 


4502780 


1062 


D-003252-08 


GGAAAGAAATGACACAGTT 


1132 


UbNrr 














CtNPF 


NM 016343 


14670380 


1063 


D-003253-05 


GCGAATATCTGAATTAGAA 


1133 


OcNPF 


NM 016343 


14670380 


1063 


D-003253-06 


GGAAATTAATGCATCCTTA 


1134 


CENPF 


NM 016343 


14670380 


1063 


D-003253-07 


GAGCGAGGCTGGTGGTTTA 


1135 


GENPF 


NM 016343 


14670380 


1063 


D-003253-08 


CAAGTCATCTTTCATCTAA 


1136 


CtNPH 














CENPH 


NM 022909 


21264590 


64946 


D-003254-05 


GAAAGAAGAGATTGCAATT 


1137 


CENPH 


NM 022909 


21264590 


64946 


D-003254-06 


CAGAACAAATTATGCAAGA 


1138 


CENPH 


NM 022909 


21264590 


64946 


D-003254-07 


CTAGTGTGCTCATGGATAA 


1139 


CENPH 


NM 022909 


21264590 


64946 


D.003254-08 


GAAACACCTATTAGAGCTA 


1140 


CHEK1 














CHEK1 


NM 001274 


20127419 


1111 


^003255-05 


CAAATTGGATGCAGACAAA 


1141 


CHEK1 


NM 001274 


20127419 


1111 


D-003255-06 


GCAAGAGTATTTCGGTATA 


1142 


CHEK1 


NM 001274 


20127419 


1111 


D-003255-07 


GGACTTCTCTCCAGTAAAC 


1143 


CHEK1 


NM 001274 


20127419 


1111 


D-003255-08 


AAAGATAGATGGTACAACA 


1144 


CHEK2 














CHEK2 


NM 007194 


22209010 


11200 


D-003256-02 


CTCTTACATTGCATACATA 


1145 


CHEK2 


NM 007194 


22209010 


11200 


D-003256-03 


TAAACGGGTGAAAGAAGCT 


1146 


CHEK2 


NM 007194 


22209010 


11200 


D-003256-04 


GCATAGGACTCAAGTGTCA 


1147 


CHEK2 


NM 007194 


22209010 


11200 


D-003256-05 


GAAATTGCACTGTCACTAA 


1148 


CNK 














CNK 


NM 004073 


4758015 


1263 


D-003257-05 


GCGAGAAGATCCTAAAT6A 


1149 


CNK 


NM 004073 


4758015 


1263 


D-003257-07 


GCAAGTGGGTTGACTACTC 


1150 


CNK 


NM 004073 


4758015 


1263 


D-003257-08 


GCAGATCCGTTGGCCATCA 


1151 


CNK 


NM 004073 


4758015 


1263 


D-003257«09 


GACCTCAAGTTGGGAAATT 


1152 


CRI1 














CRI1 


NM 014335 


7656937 


23741 


D-003258-05 


GTGATGAGATTATTGATAG 


1153 


CRI1 


NM 014335 


7656937 


23741 


D-003258-06 


GGACGAGGGCGAGGAATTT 


1154 


CKI1 


NM 014335 


7656937 


23741 


D-003258-07 


GGAAAGGGAGCCTTGCTAA 


1155 


CRM 


NM 014335 


7656937 


23741 


D-003258-08 


TCAATCGTCTGACGGAAGA 


1156 


E2F1 










. . 




E2F1 


NM 005225 


12669910 


1869 


D-003259-05 


GAACAGGGGCACTGACTOT 


1157 


E2F1 


NM 005225 


1i2669910 


1869 


D-003259-06 


TGGACCACCTGATGAATAT 


1158 


n2F1 


NM 005225 


12669910 


1869 


D-003259-07 


CCCAGGAGGTCACTTCTGA 


1159 


E2F1 


NM 005225 


12669910 


1869 


D-003259-08 


GGCTGGACCTGGAAACTGA 


1160 


E2F2 














t2F2 


NM 004091 


34485718 


1870 


D-003260-05 


GGGAGAAGACTCGGTATGA 


1161 


bZFZ 


NM 004091 


34485718 


1870 


D-003260-06 


GAGGAGAACCTGCAGATAT 


1162 


coco 
c2F2 


NM 004091 


3448571 8 


1870 


D-003260-07 


TGAAGGAGGTGATGAACAC 


1163 


c2F2 


NM 004091 


34485718 


1870 


D-003260-08 


CGAAGAAGTTCATTTACGT 


1164 


b2F3 














coco 

c2F3 


NM 001949 


12669913 


1871 


D-003261-05 


GAAATTAGATGAACTGATC 


1165 


coco 
b2F3 


NM 001949 


12669913 


1871 


D-003261-06 


TGAAGTGCGTGACTCAATA 


1166 


E2F3 


NM 001 Q4Q 




lOf 1 




oAAUAAtitpUAGCAGAAGTG 


1167 


E2F3 


NM 001949 


12669913 


1871 


D-003261^8 


GAAACACACAGTCCAATGA 


1168 


E2F4 














E2F4 


NM 001950 


12669914 


1874 


D-003262-05 


GGAGATT6CTGACAAAGTG 


1169 


E2F4 


NM 001950 


12669914 


1874 


D-003262-06 


GAAGGTATCGGGCTAATCG 


1170 


E2F4 


NM 001950 


12669914 


1874 


D-003262-07 


GTGCAGAAGTCCAGGGAAT 


1171 



119 



E2F4 


NM 001950 


12669914 


1874 


D-003262-08 


GGACAGTGGTGAGCTCAGT 


1172 


E2F5 














E2F5 


NM 001951 


12669916 


1875 


D-003263-05 


GCAGATGACTACAACTTTA 


1173 


E2F5 


NM 001951 


12669916 


1875 


D-003263-06 


GACATCAGCTACAGATATA 


1174 


E2F5 


NM 001951 


12669916 


1875 


D-003263-07 


CAACATGTCTCTGAAAGAA 


1175 


E2F5 


NM 001951 


12669916 


1875 


D-003263-08 


GAAGACATCTGTAATTGCT 


1176 


E2F6 














E2F6 


NM 001952 


12669917 


1876 


D-003264-05 


TAAACAAGGTTGCAACGAA 


1177 


E2F6 


NM 001952 


12669917 


1876 


0-003264-06 


TAGCATATGTGACCTATCA 


1178 


E2F6 


NM 001952 


12669917 


1876 


D-003264-07 


GAAACCAGATTGGATGTTC 


1179 


E2F6 


NM 001952 


12669917 


1876 


D-003264-09 


GGAACTTTCTGACTTATCA 


1180 


FOS 














FOS 


NM 005252 


6552332 


2353 


D-003265-05 


GGGATAGCCTCTCTTACTA 


1181 


FOS 


NM 005252 


6552332 


2353 


D-003265-06 


GAACAGTTATCTCCAGAAG 


1182 


FOS 


NM 005252 


6552332 


2353 


D-003265-07 


GGAGACAGACCAAGTAGAA 


1183 


FOS 


NM 005252 


6552332 


2353 


D-003265-08 


AGACCGAGCCCTTTGATGA 


1184 


HIPK2 














HIPK2 


NM 022740 


13430859 


28996 


D-003266-06 


GAGAATCACTCCAATCGAA 


1185 


HIPK2 


NM 022740 


13430859 


28996 


D-003266-07 


AGACAGGGATTAAGTCAAA 


1186 


HIPK2 


NM 022740 


13430859 


28996 


D-003266-08 


GGACAAAGACAACTAGGTT 


1187 


HIPK2 


NM_022740 


13430859 


28996 


D-003266-09 


GCACACACGTCAAATCATG 


1188 


HUS1 














HUS1 


NM 004507 


31077213 


3364 


D-003267-05 


ACAAAGGCCTTATGCAATA 


1189 


HUS1 


NM 004507 


31077213 


3364 


D-003267-06 


GAAGTGCACATAGATATTA 


1190 


HUS1 


NM 004507 


31077213 


3364 


D-003267^7 


AAGCTTAACTTCATCCTn- 


1191 


HUS1 


NM 004507 


31077213 


3364 


D-003267-08 


GAACTTGTrCAACGAATTT 


1192 


JUN 














JUN 


NM 002228 


7710122 


3725 


D-003268-05 


TGGAAACGAGCTTGTATGA 


1193 


JUN 


NM 002228 


7710122 


3725 


D-003268-06 


GAACTGCACAGCCAGAACA 


1194 


JUN 


NM 002228 


7710122 


3725 


D-003268-07 


GAGCTGGAGCGCCTGATAA 


1195 


JUN 


NM 002228 


7710122 


3725 


D-003268-08 


TAACGCAGCAGTTGCAAAG 


1196 


JUNB 














JUNB 


NM 002229 


4504808 


3726 


D-003269-05 


GCATCAAAGTGGAGCGCAA 


1197 


JUNB 


NM 002229 


4504808 


3726 


D-003269-06 


TGGAAGACCAAGAGCGCAT 


1198 


JUNB 


NM 002229 


4504808 


3726 


D-003269-07 


CATACACAGCTACGGGATA 


1199 


JUNB 


NM 002229 


4504808 


3726 


D-003269-08 


CCATCAACATGGAAGACCA 


1200 


LOC510 

53 














LOC510 
53 


NM_015895 


20127542 


51053 


D-003270-05 


ggagaaaggcgctgTatga 


1201 


LOC510 
53 


NM_015895 


20127542 


51053 


D-003270-06 


GAATAGTTCTGTCGCAAGA 


1202 


LOC510 
53 


NM_015895 


20127542 


51053 


D-003270-07 


gaacatgtagagtatatgg 


1203 


LOC510 

53 


NM_015895 


20127542 


51053 


D-003270-08 


gcagaaacaagaagaaatc 


1204 


MAD2L1 














MAD2L1 


NM 002358 


6466452 


4085 


D-003271-05 


gaaagatggcagtttgata 


1205 


MAD2L1 


NM 002358 


6466452 


4085 


D-003271-06 


taaataatgtggtggaaca 


1206 


MAD2L1 


NM 002358 


6466452 


4085 


P-003271-07 


GAAATCCGTTCAGTGATCA 


1207 


MAD2L1 


NM 002358 


6466452 


4085 


D-003271-08 


ttactcgagtgcagaaata 


1208 


IVIAD2L2 














MAD2L2 


NM 006341 


6006019 


10459 


D-003272-05 


GGAAGAGCGCGGTCATAAA 


1209 


MAD2L2 


NM 006341 


6006019 


10459 


D-003272-06 


TGGAAGAGCGCGCTCATAA 


1210 


MAD2L2 


NM 006341 


6006019 


10459 


D-003272-07 


AGCCACTCCTGGAGAAGAA 


1211 


MAD2I-2 


NM 006341 


6006019 


10459 


D-003272-08 


TGGAGAAATTCGTCTTTGA 


1212 


MCM2 















120 



MPM9 


IN IVl UU^OiCD 


00000040 


4171 


D-003273-05 


GAAGATCTTTGCCAGCATT 


1213 


MPM9 


IN IVl UuhO^D 


0000004D 


4171 


D-003273-06 


GGATAAGGCTCGTCAGATC 


1214 




INM UU*#O^D 


ooooDo4b 


4171 


D-003273-07 


CAGAGCAGGTGACATATCA 


1215 


MPM9 


MM nnAMd 

INIVI 


00006546 


>l A ^A 

4171 


D-003273-08 


GCCGTGGGCTCCT6TATGA 


1216 
















lVl^^lVlO 


INm UUZooo 


OOOOD04o 


4172 


D-003274-05 


GGACATGAATATTCTTCTA 


1217 




INlVI UU^OOo 


oooo654o 


A A ^O 

4172 


D-003274-06 


GCCAGGACATCTCCAGTTA 


1218 


mUlVIo 


NM UUzooo 


33356548 


41 72 


D-003274-07 


GCAGGTATGACCAGTATAA 


1219 




Nivl UUzooo 


o335654o 


A A "VO 

4172 


D-003274-08 


GGAAATGCCTCAAGTACAC 


1220 


















AM UoOii/4 


22047061 


4173 


D-003275-05 


GGACATATCTATTCTTACT 


1221 




AM Uouzr4 


22047061 


4173 


D-003275-06 


GATGTTAGTTCACCACTGA 


1222 




aM 030274 


oort ii ^^^^ 
22047061 


4173 


D-003275-07 


CCAGCTGCCTCATACTTTA 


1223 


MPIV/?>1 
IVIL/IVl*!" 


AM OoOii/4 


22047061 


4173 


D-003275-08 


GAAAGTACAAGATCGGTAT 


1224 


IVIL»lVlO 














IVIUlviO 


NM 006739 


2351 0447 


4174 


D-003276-05 


GAAGATCCCTGGCATCATC 


1225 


ML/lVlO 


IMM 006739 


23510447 


4M4 


D-003276-06 


GAACAGGGTTACCATCATG 


1226 


IVlL/MO 


NM 006739 


23510447 


AM A 


D-003276-07 


GGACAACATTGACTTGATG 


1227 




NM 00o739 


2351 0447 


AM A 


D-003276-08 


CCAAGGAG6TAGCTGATGA 


1228 


IVIUMd 














MUiVlO 


NM 005915 


33469920 


4175 


D-003277-05 


GGAAAGAGCTCAGAGATGA 


1229 


MUlvlb 


NM 005915 


33469920 


4175 


D-003277-06 


GAGCAGCGATGGAGAAATT 


1230 


MoMo 


NM 005915 


33469920 


4175 


D-003277«07 


GGAAACACCTGATGTCAAT 


1231 


ft fl/^R flO 

MLrMo 


NM 005915 


33469920 


4175 


D-003277-08 


CCAAACATCTGCCGAAATC 


1232 
















MOM 7 


NM 005916 


33469967 


4176 


D-003278-05 


GGAAATATCCCTCGTAGTA 


1233 




NM 005916 


33469967 


4176 


D-003278-06 


GGAAGAAGCAGTTCAAGTA 


1234 


MOM/ 


NM 005916 


33469967 


4176 


D-003278-07 


CAACAAGCCTCGTGTGATG 


1235 


ft A/^n 


NM 005916 


33469967 


4176 


D-003278-08 


GGAGAGAACACAAGGATTG 


1236 


MUMz: 














fti»r\(v flo 


NM 002392 


4505136 


4193 


I>003279-05 


GGAGATATGTTGTGAAAGA 


1237 


ft Ar^EV/io 
MUlviZ 


NM 002392 


4505136 


4193 


D-003279-06 


CCACAAATCTGATAGTATT 


1238 


ivlUMz 


NM 002392 


4505136 


4193 


B-003279-07 


GATGAGGTATATCAAGTTA 


1239 


iviUiviz 


NM 002392 


4505136 


4193 


D-003279-08 


GGAAGAAACCCAAGACAAA 


1240 


IVIBMD/ 














IVIIMO/ 


NM 002417 


19923216 


4288 


D-003280-05 


GCACAAAGCTTGGTTATAA 


1241 




NM_002417 


19923216 


4288 ^ 


D-003280-06 


CGTAAGACCTGAACTATTT 


1242 


IVllMDf 


NM 002417 


1 992321 6 


4288 


D-003280-07 


CAAAGAGGAACACAAATTA. 


1243 


IVIIXlOf 


NM 002417 


1 992321 6 


4288 


D-003280-08 


GTAAATGGGTCTGTTATTG ^ 


1244 


IVIInA I 1 














IVUNMI 1 


NM 002431 


4505224 


4331 


D-003281-05 


GGAAGAAGCTTTAGAAGTG 


1245 


IVIlNAI 1 


NM 002431 


4505224 


4331 


D-003281-06 


TAGATGAGCTGGAGAGTTC 


1246 


SV/IMAT-I 
iVilNM 1 1 


NM 002431 


4505224 


4331 


D-003281-07 


GGACCTTGCTGGAGGCTAT 


1247 


MMAT1 
IVIINAA I 1 


NM 002431 


4505224 


4331 


D-003281-08 


GCAGATAGAGACATATGGA 


1248 


iVI YLr 














R/IVP 
iVl YL/ 


KIAA AAOvl^T 

NM 002467 


31543215 


4609 


D-003282-05 


CAGAGAAGCTGGGCTCCTA 


1249 


IVl YL/ 


KIKil AAOylC7 

NM 002467 


31 54321 5 


4609 


D-003282-06 


GAAACGACGAGAACAGTTG 


1250 


IVl YO 


NM 0024o7 


31543215 


4609 


D-003282-07 


CGACGAGACCTTCATCAAA 


1251 


IV! YO 


NM 002467 


tr A 004 tT 

31643215 


4609 


D-003282-08 


CCACACATCAGCACAACTA 


1252 
















ORC1L 


NM 004153 


31795543 


4998 




(ti A A P A /^(^ A A TTP P A A P A A 




0RC1L 


NM 004153 


31795543 


4998 


D~003283-06 


TAAGAAACGTGCTCGAGTA 


1254 


ORC1L 


NM 004153 


31795543 


4998 


D-003283-07 


GAGATCACCTCACCTTCTA 


1255 


ORC1L 


NM 004153 


31795543 


4998 


D-003283-08 


GCAGAGAGCCCTTCTTGGA 


1256 


ORC2L 














ORC2L 


NM 006190 


32454751 


4999 


D-003284-05 


GAAGAAAGGTCCTATGAGA 


1257 



121 



0RC2L 


NM 006190 


32454751 


4999 


D-003284-06 


GAAGGGAACTGATGGAGTA 


1258 


0RC2L 


NM 006190 


32454751 


4999 


D-003284-07 


GAAGAATGATCCTGAGATT 


1259 


ORC2L 


NM 006190 


32454751 


4999 


D-003284-08 


GAAGAGATGTTCAAGAATC 


1260 


ORC3L 














ORC3L 


NM 012381 


32483366 


23595 


D-003285-05 


GGAGTGCTGTGTAGATATA 


1261 


0RC3L 


NM 012381 


32483366 


23595 


D-003285-06 


GAACTGATGACCATACTTG 


1262 


0RC3L 


NM 012381 


32483366 


23595 


D-003285-07 


AAAGATCTCTCTGCCAATA 


1263 


0RC3L 


NM 012381 


32483366 


23595 


D-003285-08 


CAGCACAGCTAAGAGAATA 


1264 


ORC4L 














ORC4L 


NM 002552 


32454749 


5000 


D-003286-06 


GAAAGGACATTCCGTTTAT 


1265 


ORC4L 


NM 002552 


32454749 


5000 


D-003286-07 


TGAAAGAACTCATGGAAAT 


1266 


ORC4L 


NM 002552 


32454749 


5000 


D-003286-08 


GCTGAGAAGTGGAATGAAA 


1267 


ORC4L 


NM 002552 


32454749 


5000 


D-003286-09 


CCAGTGATCTTCATATTAG 


1268 


ORC5L 














ORC5L 


NM 002553 


32454752 


5001 


D-003287-05 


GAAATAACCTGTGAAACAT 


1269 


ORC5L 


NM 002553 


32454752 


5001 


D-003287-06 


CAGATTAGCTCTGTAGTGA 


1270 


ORC5L 


NM 002553 


32454752 


5001 


D-003287-07 


GAACTTCCATATTACTCTA 


1271 


0RC5L 


NM 002553 


32454752 


5001 


D-003287-08 


GTATTCAGCTGATTTCTAT 


1272 


ORC6L 














ORC6L 


NM 014321 


32454755 


23594 


D-003288-05 


GAACATGGCTTCAAAGATA 


1273 • 


0RC6L 


NM 014321 


32454755 


23594 


D-003288-06 


GGACAGGGCTTATTTAATT 


1274 


ORC6L 


NM 014321 


32454755 


23594 


D-003288-07 


GAAAGAAGATAGTGGTTGA 


1275 


ORC6L 


NM 014321 


32454755 


23594 


D-003288-08 


TATCAGAGCTGTCTTAAAT 


1276 


PCNA 














PCNA 


NM 002592 


33239449 


5111 


D-003289-05 


GATCGAGGATGAAGAAGGA 


1277 


PCNA 


NM 002592 


33239449 


5111 


D-003289-07 


GCCGAGATCTCAGGCATAT 


1278 


PCNA 


NM 002592 


33239449 


5111 


D-003289-09 


GAGGCCTGCTGGGATATTA 


1279 


PCNA 


NM 002592 


33239449 


5111 


D-003289-10 


GTGGAGAACTTGGAAATGG 


1280 


PLK 














PLK 


NM 005030 


21359872 


5347 


D-003290-05 


CAACCAAAGTCGAATATGA 


1281 


PLK 


NM 005030 


21359872 


5347 


D-003290-06 


CAAGAA6AATGAATACAGT 


1282 


PLK 


NM 005030 


21359872 


5347 


D-003290-07 


GAAGATGTCCATGGAAATA 


1283 


PLK 


NM 005030 


21359872 


5347 


D-0QS290-08 


CAAGAGGGCTCATCCTCTA 


1284 


PIN1 














PIN1 


NM 006221 


5453897 


5300 


D-003291-05 


GGACCAAGGAGGAGGCCCT 


1285 


PIN1 


NM 006221 


5453897 


5300 


D-003291-06 


CGTCCTGGCGGCAGGAGAA 


1286 


PIN1 


NM 006221 


5453897 


5300 


D-003291-07 


CGGGAGAGGAGGACTTTGA 


1287 


PIN1 


NM 006221 


5453897 


5300 • 


D-003291-08 


AGTCGGGAGAGGAGGACTT 


1288 _ 


PIN1L 














PIN1L 


NM_006222 


5453899 


5301 


D-003292-06 


CGACCTGGCGGCAGGAAAT 


1289 


PIN1L 


NM 006222 


5453899 


5301 


D-003292-07 


AG6CAGGAGAGAAGGACTT 


1290 


PIN1L 


NM 006222 


5453899 


5301 


D-003292-08 


GCTACATCCAGAAGATCAA 


1291 


PIN1L 


NM 006222 


5453899 


5301 


D-003292-09 


GGACAGTGTTCACGGATTC 


1292 


RAD1 














RAD1 


NM 002853 


19718797 


5810 


D-003293-05 


GAAGATGGACAAATATGTT 


1293 


RAD1 


NM 002853 


19718797 


5810 


D-003293-06 


GGAAGAGTCTGITACTTTT 


1294 


RAD1 


NM 002853 


19718797 


5810 


D-003293-07 


GATAACAGAGGCTTCCTTT 


1295 


RAD1 


NM 002853 


19718797 


5810 


D-003293-08 


GCATTAGTCCTATCTTGTA 


1296 


RAD17 














RAD17 


NM 133338 


19718783 


5884 


D-003294-05 


GAATCAAGCTTCCATATGT 


1297 


rvMj 1 f 


iNIVl lOOOOO 


19718783 


5884 


D-003294-06 


CAACAAAGCCCGAGGATAT 


1298 


RAD17 


NM 133338 


19718783 


5884 


D-003294-07 


ACACATGCCTGGAGACTTA 


1299 


RAD17 


NM 133338 


19718783 


5884 


D-003294-08 


GTACATAGATTTCTTCATG 


1300 


RAD9A 














RAD9A 


NM 004584 


19924112 


5883 


D-003295-05 


TCAGCAAACTTGAATCTTA 


1301 


RAD9A 


NM 004584 


19924112 


5883 


D-003295-06 


GACATTGACTCTTACATGA 


1302 



122 





INIVI UU4D04 


A AAO AAAI^ 

199241 12 


5883 


D-003295-08 


GGAAACCACTATAGGCAAT 


1303 


RAHQA 


INIvl UU*fDc54 


1 99241 1 2 


5883 


D-003295-09 


CGGACGACTTTGCCAATGA 


1304 


RR1 














RR1 


INIVI UUUoZl 


<l AAO AAA^ 

19924112 


5925 


D-003296-05 


GAAAGGACATGTGAAGTTA 


1305 


RR1 


rMivl uuUo^l 


19924112 


5925 


D-003296-06 


GAAGAAGTATGATGTATTG 


1306 


RR1 




A Cf\^ A*% A 

4506434 


5925 


D-003296-07 


GAAATGAGTTCTACTCGAA 


1307 


RR1 


l>Jlvl_UUUo^1 


ACt\^ A*^ A 

4506434 


5925 


D-003296-08 


GGAGGGAACATCTATATTT 


1308 
















RRRDO 


NM UUoOOD 


4826967 


5927 


D-003297-05 


CAAAGAAGCTGAATAAACT 


1309 




MM .005056 


4826967 


5927 


D-003297-^6 


CAACACATATGGCGGATTT 


1310 




iNlvi UUoOoD 


4826967 


5927 


D-003297-07 


GGACAAACCTAGAAAGAAG 


1311 


RRRDO 


KiiVA AACACC 

NM UUbOoo 


4826967 


5927 


D-003297-08 


GAAAGGCACTCTCTCTGTT 


1312 


RRI i 














rxDLI 


NM 002895 


34577078 


5933 


D-003298-05 


CAAGAGAAGTTGTGGCATA 


1313 




InM 002895 


34577078 


5933 


D-003298-06 


CAGCAGGACTCCATTTATA 


1314 


PRI i 
rxDLI 


NM 002895 


34577078 


5933 


D-003298-07 


ACAGAAAGGTCTATCATTT 


1315 


PRI i 


NM 002895 


34577078 


5933 


D-003298-08 


GGAGATAAAGTTACAATTC 


1316 


PRI O 














PRI O 


NM 005611 


21361291 


5934 


D-003299-05 


GAGCAGAGCTTAATCGAAT 


1317 


QRI O 
KdL^ 


■ NM 005611 


21361291 


5934 


D-003299-06 


GAGAATAGCCGTTGTGTGA 


1318 ' 


DQt O 

KdL2 


NM 005611 


21361291 


5934 - 


D^03299-07 


GGACTTAGTTTATGGAAAT 


1319 


DDI O 


NM_00561 1 


21361291 


5934 


D-003299-08 


GAATTTAGATGAGGGGATA 


1320 
















KbP1 


NM 002899 


8400726 


5947 


D-003300-05 


GAGACAAGCTCCAGTGTGT 


1321 


RdPI 


NM 002899 


8400726 


5947 


D-003300-06 


GCAAGCAAGTATTCAAGAA 


1322 


KBrl 


NM 002899 


8400726 


5947 


D-003300-07 


GCAGGACGGTGACCATATG 


1323 


RBPI 


NM 002899 


8400726 


5947 


D-003300-08 


GGAAGTGCATGACAACAGT 


1324 


RPA3 














RPA3 


NM 002947 


19923751 


6119 


D-003322-05 


GGAAGTGGTTGGAAGAGTA 


1325 


KPAo 


NM 002947 


19923751 


6119 


D-003322-06 


GAAGATAGCCATGCI NIG 


1326 


KPAo 


NM 002947 


19923751 


6119 


D-003322-07 


CATGCTAGCTCAATTCATC 


1327 


KPAo 


NM_002947 


19923751 


6119 


D-003322-08 


GATCTTGGACTTTACAATG 


1328 


Ol\P lA 














CI^D't A 

otvPlA 


NM 006930 


25777710 


6500 


D-003323-05 


GGAGAGATATTTGAAGTT6 


1329 


oKPIA 


NM 006930 


25777710 


6500 


D-003323-06 


GGGAATGGATGATGAAGGA 


1330 


QI^D^ A 

ofvPIA 


NM 006930 


25777710 


6500 


D-003323-07 


CAAACAATCTGTGACTATT 


1331 


CI^D<4 A 

OlVr lA 


NM 006930 


25777710 


6500 


D-O03323-08 


TCAATTAAGTTGGAGAGTT 


1332 


-OlVPZ 














Ql^DO 

olXnz 


NM 005983 


16306594 


6502 


D-003324-05 


CATCTAGACTTAAGTGATA 


1333 


ol\rz 


NM 005983 


16306594 


6502 


D-003324-06 


GAAATCAGATCTCTCTACT 


1334 


OINr'Z 


NM 005983 


16306594 


6502 


D-003324-07 


CTAAAGGTGTCTGGTGTTT 


1335 




NM 00o9o3 


16306594 


6502 


D-003324-08 


GATGGTACCCTTCAACTGT 


1336 


OiNlV 














Oi>il\ 


NM 006622 


5730054 


10769 


D-003325-05 


GAAGACATCTACAAGCTTA 


1337 


QKII^ 
OfMlV 


NM O0dd22 


5730054 


10769 


D-003325-06 


GAAATACCTTCATGAACAA 


1338 


OINIV 


NM 006622 


5730054 


10769 


D-003325-07 


GAAGGTCAATGGCTCATAT 


1339 


olNI\ 


NM_U0OO^2 


5730054 


10769 


D-003325-08 


CCGGAGATCTCGCG6ATTA 


1340 


O 1 l\1Z 














O 1 IMZ 


NM 004217 


4759177 


9212 


D-003326-07 


CAGAAGAGCTGCACATTTG 


1341 


O 1 l\ liC 


NM U04217 


47591 77 


9212 


D-003326-08 


CCAAACTGCTCAGGCATAA 


1342 


STK12 


NM 004217 


4759177 


9212 




Ar^riPrir^o Ar^TTr* Ar* A AXT/^ 
M^VjOooV^/\0 1 1 UAOMA 1 1 o 


1343 


STK12 


NM 004217 


4759177 


9212 


D-003326-10 


TGGGACACCCGACATCTTA 


1344 


TFDP1 














TFDP1 


NM 007111 


34147667 


7027 


D-003327-05 


GGAAGCAGCTCTTGCCAAA 


1345 


TFDP1 


NM 007111 


34147667 


7027 


D-003327-06 


GAGGAGACTTGAAAGAATA 


1346 


TFDP1 


NM 007111 


34147667 


7027 


D-003327-07 


GAACTTAGAGGTGGAAAGA 


1347 



123 



TFHPI 


KIM nOTiii 


0414f00/ 


7027 


D-003327-08 


GCGAGAAGGTGCAGAGGAA 


1348 


TFDP? 














TFnP9 


MM nOftOft^ 


D4041 i 1 


7AOn 

/^02y 


□-003328-05 


GAAAGTGTGTGAGAAAGTT 


1349 


TFnP9 

1 1 L/i ^ 


INIVI UUO^cOO 


04041 1 1 


7AOA 

f 029 


D-003328-06 


CACAGGACCTTCTTGGTTA 


1350 


TFDP? 


KIM nn^xOftA 

IMIVI UUOiCOO 


l^ARA4 4 4 
04041 1 1 


7AAA 

7029 


D-003328-07 


CGAAATCCCTGGTGCCAAA 


1351 


TFnP9 


INIVI UUOZou 


RARA<i 4 4 
04041 1 1 


7029 


D-003328-08 


TGAGATCCATGATGACATA 


1352 


TP'^'^ 














TP*;'^ 

1 1 v>0 


INIVI UUU040 


o400/o7 


71 57 


D-003329-05 


GAGGTTGGCTCTGACTGTA 


1353 


1 t^OO 


INIVI UUU04D 


0400737 


7157 


D-003329-06 


CAGTCTACCTCCGGCCATA 


1354 




INM UUUt)4D 


O400737 


7157 


D-003329-07 


GCACAGAGGAAGAGAATCT 


1355 


I "OO 


INIVI UUUo4D 


0400737 


7157 


D-003329-08 


GAAGAAACCACTGGATGGA 


1356 


TP*^':^ 
1 nOo 














TPft^ 


INIVI KJUO/ZZ 


31543817 


8626 


D-003330-05 


CATCATGTCTGGACTATTT 


1357 


1 r^Do 


INM UUo/ZZ 


31543817 


8626 


D-003330-06 


CAAACAAGATTGAGATTAG 


1358 


TPA*^ 
1 nOo 


NIVI UUo7/:2 


31543817 


8626 


D-003330-07 


GCACACAGACAAATGAATT 


1359 


1 nDo 


INiVI UUo/z2 


31543817 


8626 


D-003330-08 


CGACAGTCTTGTACAATTT 


1360 


Info 














1 nf o 


NM 005427 


4885644 


7161 


D-003331-05 


GCAAGCAGCCCATCAAGGA 


1361 


1 r/ o 


Klh/i AAC^O"7 

NM 005427 


4885644 


7161 


D-003331-06 


GAGACGAGGACACGTACTA 


1362 


TD7Q 


NM-U05427 


4885644 


7161 


D-003331-07 


CTGCAGAACCTGACCATTG 


1363 


TP73 


NM 005427 


4885644 


71 R1 


nAAOOOH AO 


ooOCA 1 (sUCTGl 1 lACAAG 


1364 


YWHAZ 














YWHAZ 


NM 003406 


21735623 


7534 


D-003332-05 


GCAAGGAGCTGAATTATCC 


1365 


YWHAZ 


NM 003406 


21735623 


7534 


D-003332-06 


TAAGAGATATCTGCAATGA 


1366 


YWHAZ 


NM 003406 


21735623 


7534 


D-003332-07 


GACGGAAGGTGCTGAGAAA 


1367 


YWHAZ 


NM 003406 


21735623 


7534 


D-003332-08 


AGAGCAAAGTCTTCTATTT 


1368 



Table IX 



Gene 
Name 


Accession # 


Gl# 


Duplex # 


Sequence 


SEQ. ID 
NO. 


AR 


NM 000044 


21322251 


D-003400-01 


GGAACTCGATGGTATCATT 


1369 


AR 


NM 000044 


21322251 


D-003400-02 


CA^GGAGGTTACACCAAA 


1370 


AR 


NM 000044 


21322251 


D-003400-03 


TCAAGGAACTCGATCGTAT 


1371 


AR 


NM 000044 


21322251 


D-003400r04 


GAAATGATTGCAGTATTGA 


1372 














ESR1 


NM 000125 


4503602 


D-003401-01 


GAATGT6CCTGGCTAGAGA 


1373 


ESR1 


NM 000125 


4503602 


D-003401-02 


CATGAGAGCTGCCAACCTT 


1374 




NM 000125 


4503602 


D-003401-03 


AGAGAAAGATTGGCCAGTA 


1375 


ESR1 


NM 000125 


4503602 


D-003401-04 


CAAGGAGACTCGCTAGTGT 


1376 














ESR2 


NM 001437 


10835012 


D-003402-01 


GAACATCTGCTCAACATGA 


1377 


ESR2 


NM 001437 


10835012 


D-003402-02 


GCAGGGGTCCATATACATA 


1378 


ESR2 


NM 001437 


10835012 


D-003402-03 


CAAGAAGATTCCCGGCTTT 


1379 


ESR2 


NM 001437 


10835012 


D-003402-04 


GGAAATGCGTAGAAGGAAT 


1380 














ESRRA 


NM 004451 


18860919 


D-003403-01 


GGCCTTCGCTGAGGACTTA 


1381 


ESRRA 


NM 004451 


18860919 


D-003403-02 


TGAATGCACTGGTGTCTCA 


1382 


ESRRA 


NM 004451 


18860919 


D-003403-03 


GCATTGAGCCTCTCTACAT 


1383 


ESRRA 


NM 004451 


18860919 


D-003403-04 


CGAGACAGCGGGGAAAGTG 


1384 














ESRRB 


NM 004452 


22035686 


D-003404-01 


TACCTGAGCTTACAAATTT 


1385 


ESRRB 


NM 004452 


22035686 


D-003404-02 


GCACTTGTATAGCGTCAAA 


1386 


ESRRB 


NM 004452 


22035686 


D-003404-03 


CAACTCCGATTCCATGTAG 


1387 


ESRRB 


NM 004452 


22035686 


D-003404-04 


GGACTCGCCACCCATGTTT 


1388 














ESRRG 


NM 001438 


4503604 


D-003405-01 


AAACAAAGATCGAGAGATT 


1389 



124 



ESRRG 




4'^0'^fin4 


U-UUo4UO-UZ 


TCAG G AAACTGTATG ATGA 


1390 


ESRRG 


NM 00143ft 


4'^0'^ftn4 


nno>inc oo 
U-UUo4Uo-0o 


GAAGACCAGTCCAAATTAG 


1391 


ESRRG 


NM 001 43A 


*rOuODU4 


V\ nno>iAc Ail 


ATGAAGCGCTGCAGGATTA 


1392 














HNF4A 


NM 0004'i7 


Z 1 OD 1 1 o4 


U-003406-01 


CGACATCACTGGAGCATAT 


1393 


HNF4A 


NM Q004S7 


Z 1 OD 1 1 o4 


U-UUo40b-02 


GAAGGAAGCCGTCCAGAAT 


1394 


HNF4A 


NM 0004*^7 


C, 1 OO 1 1 04 


U-003406-03 


CCAAGTACATCCCAGCTTT 


1395 


HNF4A 


NM nnn4'?7 


91 '5ft1 1fi>* 
Z loo I lo4 


lJ-003406-04 


GGACATGGCCGACTACAGT 


1396 














HNF4G 


NM 0041 '^'^ 

IN IVI L»U*f 1 OO 


DOolUo/ 


P\ AAO>l/^'^ 

lJ-003407-01 


GCACTGACATAAACGTTAA 


1397 


HNF4G 


NM 0041 '^'^ 


OOOlUc)/ 


L)-003407-02 


ACAAAGAGATCCATGATGT 


1398 


HNF4G 


MM 0041'^'^ 

INIVI UU*f 1 OO 


DOol Uo/ 


D-003407-03 


AGAGATGCATGATGTATAA 


1399 


HNF4G 


MM 0041 '^'^ 
iNIVl \J\JH I OO 


DDolUo/ 


D-003407-04 


AAATGAACGTGACAGAATA 


1400 
















MM 017'^'^'5 
INIVI U 1 / ooz 


0001770 


D-003408-01 


GAATGAATCTACACCTTTG 


1401 




MM ni7'N'^0 
INIVI U 1 / DOZ 


oyzo/^ /O 


D-003408-02 


GGAAATACGTGGAGACACT 


1402 




MM niT'^'^O 
INIVI U 1 / OOZ 


oy^io/ /o 


U-003408-03 


CCAGATAACTACGGCGATA 


1403 




MM ni7C\'^0 


oy^o/ /o 


D-003408-04 


TGGCGTACCTTCTCATTGA 


1404 














NR0B1 


MM nnnA7f; 


oUlouoy 


D-003409-01 


CAGCATGGATGATATGATG 


1405 


NRORI 


MM nOOyf 
INIVI UUU4 / O 


5016089 


D-003409-02 


CTGCTGAGATTCATCAATG 


1406 


NRORI 


MM OOn^TK 
INIVI UUU4fO 


501 6089 


D-003409-03 


ACAGATTCATCGAACTTAA 


1407 


NRORi 


MM nno>i 7c 


5016089 


D-003409-04 


GAACGTGGCGCTCCTGTAC 


1408 
















MM nO-IQGfi 


1 O259502 


D-003410-01 


GAATATGCCTGCCTGAAAG 


1409 


NR0R9 


INIVI uziyoy 


1 O259502 


D-003410-02 


GGAATATGCCTGCCTGAAA 


1410 




IN IVI Uiciyoy 


1 3259502 


D-003410-03 


CGTAGCCGCTGCCTATGTA 


1411 


NRnR9 


INIVI uzi yoy 


1 3259502 


D-0034 10-04 


GCCATTCTCTACGCACTTC 


1412 
















INIVI UZ 1 f Z4 


1 O430847 


D-003411-01 


CAACACAGGTGGCGTCATCTT 


1413 




MR/I nOiTOil 
INIVI UZ 1 / Z4 


13430847 


D-003411-02 


GGCATGGTGTTACTGTGTATT 


1414 


NRini 


MR/I no-iTOvi 

INM UiC 1 / 4^4 


13430847 


D-0034 11-03 


CAAGATGCATTCCGAGAAGTT 


1415 


NRini 


MM noi 70/1 

INIVI UZ1 / Z4 


1 3430847 


D-003411-04 


GCGC l tfrGCTTCGTTGTTCTT 


1416 














NR1H2 


NM 007121 


11321629 


D-003412-01 


GAACAGATCCGGAAGAAGA 


1417 


NR1H2 


NM 007121 


11321629 


D-003412-02 


GAAGAACAGATCCGGAAGA 


1418 


NR1H7 


MM nriT'i oi 

INIVI UUfJZl 


11321629 


D-003412-03 


CTAA6CAAGTGCCTGGTTT 


1419 


NR1 H2 


MM nn7i oi 

INIVI UU f I £\ 


11321629 


D-003412-04 


GGTAACAGCGGCTCAAGAA 


1420 














NRIH"^ 
NRIH") 


MM nn'^RQ') 
iNivi uuooyo 

MM rinc;f%Q'^ 
INIVI vjuooyo 


OUo io9^ 
oUol o92 


D-003413-01 
D-0034 13-02 


GAACAGATCCGGCTGAAGA 


1421 


NR1H3 


MM nn<%RQ'% 
INIVI v/uooyo 


oUo1 


u-003413-03 


GGAGATAGTTGACTTTGCT 
GAG 1 rTGCGTTGCTCATTG 


1422 
1423 


NR1 H3 


MM nn^fto 
IN ivi__uu ooyo 


oOol o92 


D-003413-04 


TGACl 1 IGCTAAACAGCTA 


1424 














NR1 H4 


MM nO'^iO') 
INIVI UUuIZO 


y|QOi5n7rk 
40^09^9 


D-0034 14-01 


CAAGTGACCTCGACAACAA 


1425 


NR1 H4 


MM nO'xIO'^ 

INIVI UUO 1 ZO 


4ozoy f 9 


D-0034 14-02 


GAAAGAATTCGAAATAGTG 


1426 


NR1 H4 


MM 00*^1 9*^ 

INIVI Uv\J 1 ZO 


4ozoy/ y 


D-00341 4-03 


CAACAGACTCTTCTACATT 


1427 


NR1 H4 


MM nO'nlO'^ 
INIVI WUO 1 C.O 


4oZDy/y 


D-0034 14-04 


GAACCATACTCGCAATACA 


1428 














NR1I2 


MM OO'^ftftQ 


nooolo3 


D-003415-01 


GAACCATGGTGACTTTGTA 


1429 


NR1I2 


NM 003889 


11863133 


D-0034 15-02 


GATGGACGCTCA(^ATr;AAA 


i40U 


NR1I2 


NM 003889 


11863133 


D-003415-03 


CAACCTACATGTTCAAAGG 


1431 


NR1I2 


NM 003889 


11863133 


D-003415-04 


CAGGAGCAATTCGCCATTA 


1432 














NR1 13 


NM 006122 


4826660 


D-003416-01 


GGAAATCTGTCAGATCGTA 


1433 


NR1I3 


NM 006122 


4826660 


D-003416-02 


TCGCAGACATCAACACTTT 


1434 



NR1I3 


NM 005122 


48266f)n 


L/-UU04 1 D-UO 


UU 1 u 1 1 U(3(J TACACAATTG 


1435 


NR1I3 


NM 005122 


4826660 


U-UU04 1 0-U4 


UAAuACj I 1 1 GTGCAGTTTA 


1436 














NR2C1 


NM 003297 


4507679 


1^-00^^41 7 ni 


1 oAOAvjOACTTGATCATAA 


1437 


NR2C1 


NM 003297 


4507679 


n«00'^4i7 no 


CjCjAAGGAAGTGTACACCTA 


1438 


NR2C1 


NM 003297 


4507679 


n_00'^4 1 7 HQ 


OiAGCAGATCTTCAAACTAC 


1439 


NR2C1 


NM 003297 


4507fi79 


n_00'^417 A/f 


IjAACjAAA ITGCACATCAAA 


1440 














NR2C2 


NM 00'^2Qft 




iJ-UU>541o-01 


GAACAACGGTGACACTTCA 


1441 


NR2C2 


NM 0032Q8 




U-UU041 t5-0Z 


OTG ATG AG CTCC AACATAA 


1442 


NR2C2 


NM 0032Q8 


4.^07R7A 




CAACCTAAGTGAATCTTTG 


1443 


NR2C2 


NM 003298 


4'?07ft74 


ri-AnOyi 1 Q n>i 
U-UUo41 0-04 


GAAGACACCTACCGATTGG 


1444 














NR2E1 




91 '^Rl 1 0ft 


r\_nnovi4n rk-i 


GATCATATCTGAAATACAG 


1445 


NR2E1 




4C. 1 OO f f UO 


U-UUo4 19-02 


CAAGACTGCTTTCAGATAT 


1446 


NR2E1 




91 '^Rl 1 0ft 


iJ-UUo41y-0o 


GTTAGATGCTACTGAATTT 


1447 


NR2E1 




Z i oOl lUo 


U-UUo4 19-04 


CAATGTATCTCTATGAAGT 


1448 














NR2E3 


NM 01494Q 


/ DO / 


U-UUo4zO-01 


GAGAAGCTCCTTTGTGATA 


1449 


NR2E3 




7R'\71Qy1 


IJ-00o420-02 


GAAGCACTATGGCATCTAT " 


1450 


NR2E3 


NM ni494Q 


/ OO l ^ifH 


r\ A AO yi or> r\ o 

U-00o420-03 


GAAGGATCCTGAGCACGTA 


1451 


NR2E3 


NM nid94Q 


f DO / oy*!- 


U-00o420-04 


GAAGCTCCTTTGTGATATG 


1452 














NR2F1 


NM nnsR^^ 




u-0uo421-01 


GAAAGTCTCATCCGCGATA 


1453 


NR2F1 


NM nrmRc^ 




D-003421-02 


TCTCATCCGGGATATGTTA 


1454 


NR2F1 


NM nnRR'>4 


\ £. ( 4o4 


lJ-003421-03 


CAAGAAGTGCCTCAAAGTG 


1455 


NR2F1 


NM nn^f^RA 




D-003421-04 


GGAACTTAACTTACACATG 


1456 














NR2F2 


NM 091 nn«^ 


14 l4y/40 


U-003422-01 


GTACCTGTCCGGATATATT 


1457 


NR2F2 


NM 091 nn«^ 


1 4 I4y /40 


U-003422-02 


CCAACGAGCCGACGAGATT 


1458 


NR2F2 


NM 091 00^ 


1 4 1 4y /40 


Li-^03422-03 


AGTCGTACGTGTCCGGATA 


1459 


NR2F2 


NM 091 00(^ 


14 l4y/40 


U-0U3422-04 


GGCCGTATATGGCAATTCA 


1460 














NR2F6 


NM 00S9'^4 


900701 QR 


U-UUo42o-01 


CGACGCCTGTGGCCTCTCA 


1461 


NR2F6 


NM 00*>9'^4 


9nn7niQft 


r\ oooittoo OO 

U-u0o423-02 


CAGCCGGTGTCCGAACTGA 


1462 


NR2F6 


NM 00*^9"^ 


zuu^uiyo 


AAOvlOO OO 

iJ-00o423-03 


CAACCGTGACTGCCAGATC 


1463 


NR2F6 


NM 0059'^d 


900701011 


U-UUo4Zo-04 


GTACTGCGGTGTCAAGAAG 


1464 














NR3C1 


NM 0001 7R 


*fOU4 1 OZ 


Li-U0o424-01 


GAGGACAGATGTACGACTA 


1465 


NR3C1 


NM 0001 7R 


*TOU*t 1 


r\ nnoiiovi oo 
U-UUo424-02 


GATAAGACCATGAGTATTG 


1466 


NR3C1 


NM 0001 7R 




IJ-UUo4Z4-0o 


GAAGACGATTCATTCCTTT 


1467 


NR3C1 


NM 000178 




r4_nnQ>ioyi nvi 
IJ-UU04Z4-U4 


GGAGAGATGTACGACTATG 


1468 














NR3C2 


NM 000901 


*f ouo 1 yo 


L;-UUo4iiO-U 1 


GCAAACAGATGATGGAAGT 


1469 


NR3C2 


NM 000901 


*tuuo 1 yo 


nnQyioc no 
U-uUo4Zo-UZ 


GAGGTAAGATTTATGAGAA 


1470 


NR3C2 


NM 000901 


4*^0^1 Oft 


r\ nnQ/ioc no 
U-UUo4ZO-Uo 


^^^^ A ^^^^ AAA ^^^r*.^^ AAA m M. — - 

GCAGGAAAGTGAAAGAAGT 


1471 


NR3C2 


NM 000901 


4<=^0^1Qft 


"^rVAOylOC AyJ 


GGTATCCGGTGTTAGAATA 


1472 














NR4A1 


NM 0021*^*5 


it 1 OO 1 OH 1 


n nno/iofj n-4 

U-UUo4ZO-U 1 


GAAGGAAGTTGTGGGAAGA 


1473 


NR4A1 


MM 0021 ?S 


91*^81 '^41 

^ 1 OO I 0*f 1 


r^_Ano>ioc no 


CAGGAGAGTTTGAGACCTT 


1474 


NR4A1 


NM 002135 


21361341 


D-003426-03 


GAGTGGCTGTGAGTACTAT 


l*f /O 


NR4A1 


NM 002135 


21361341 


D-003426-04 


GAAGGCCGCTGTGCTGTGT 


1476 














NR4A2 


NM 006186 


5453821 


D-003427-01 


GCAATGCGTTCGTGGCTTT 


1477 


NR4A2 


NM 006186 


5453821 


D-003427-02 


CGGCTACACAGGAGAGTTT 


1478 


NR4A2 


NM 006186 


5453821 


D-003427-03 


CCACGTGACnrCAACAAT 


1479 



126 



NR4A2 


MM nOftlAR 

INI VI vV/QlOO 


uHDOOZ 1 


n An0/I07 Ay! 

L*-UUo4Z / -U4 


GAATACAGCTCCGATTTCT 


1480 














NR4A3 


MM nORQAI 


1 1 97An7n 


n AAO/IOD (\A 


^ AAA A A A A A ^\ A 111 1 « 

CAAAGAAGATCAGACATTA 


1481 


NR4A3 


MM OHRQAI 

INIVI UUO^O 1 


1 IZ/OU/U 


AAO>IOO AO 


GATCAGACATTACTTATTG 


1482 


INI \*TrAO 


MM nnf^Qfti 

INIVI UUD\70 1 


1 lil/OU/U 


r\ AAO>ioo AO 


CCAGAGATCTTGATTATTC 


1483 


NR4A3 


MM nnRQfti 

l>IIVI_ 1 


1 1 07Rf\7f\ 
1 1 Z/OUr U 


n AAOyiOQ A>l 

L/-UUo42o-04 


GAAGTTGTGCGTACAGATA 


1484 














NRSA1 




9nn7ni qo 


AAOiiOA f\A 

Li-003429-01 


GATTTGAAGTTCCTGAATA 


1485 




MM nn/ioc;Q 
iMivi vu*fyoy 


0An7A'1OO 


AAOvlOA rko 

U-003429-02 


GGAGCGAGCTGCTGGTGTT 


1486 




MM nn/lQi^Q 


OflAVA-l OO 

zuu/uiyz 


r\ AAO>IOA AO 

U-0U3429-03 


GGAGGTGGCCGAGGAGATG 


1487 


NRSA1 


iNivi uu^yoy 


OAA7A'1 AO 


A AO A OA f\ A 

U-003429-04 


CAACGTGCCTGAGCTCATC 


1488 
















INlVl UUOOZZ 


ZUU/Ulbl 


D-003430-01 


CCAAACATATGGCCACTTT 


1489 








D-003430-02 


TCAGAGAACTTAAGGTTGA 


1490 




MM nn'ifloo 


OAA"7A'1 a A 


D-003430-03 


GGATCCATCTTCCTGGTTA 


1491 


NRSA9 
1 N f \\jr\^ 


MM nn*^AOo 


OAATAi CI 
zUU/U ID I 


p\ OAO A OA /\ A 

D-U03430-04 


AAGAATACCTCTACTACAA 


1492 














INrxUM 1 


MKA 


1o4o1o47 


D-003431 -01 


CAACGAACCTGTCTCATTT 


1493 




M^/l HQ'^Q'iyl 


lo4blo47 


/\t\^A^A 

D-003431-02 


GAAGAACTACACAGATTTA. 


1494 


NRRAi 


INM Uoaoo4 


TJ5451 o47 


D-003431-03 


GAAGATGGATACGCTGTGA 


1495 


INr\OM J 


INM Uoooo4^ 


15451847 


D-003431-G4 


AAACGATACTGGTACATTT 


1496 














ntill 
1 lUll 


u loolo 


z11do71 


AAA ^ A^ 

D-003432-01 


GAAGAATGATCGAATAGAT 


1497 


1 lull 


UlDOlO 


21 16671 


D-003432-02 


GAACATGGAGCAATATAAT 


1498 


null 

nun 


L/lDolO 


2116671 


D-003432-03 


GAGGAGCTCTTGGCCTTTA 


1499 


ntill 
1 lull 


LIlDo i5 


2116671 


D-003432-04 


TAAACAACATGCACTCTGA 


1500 
















NM k)\ju\jZo 


4505766 


D-003433-01 


GAGATGAGGTCAAGCTACA 


1501 




iNivi UUUy^D 


4505766 


D-003433-02 


CAGCGTTTGTATGAACTTA 


1502 




INM UUU9ilD 


4505766 


D-003433-03 


AGATAACTCTCATTCAGTA 


1503 


pr;R 


INM UUUyiiO 


4505766 


AAA 4 AA A ^ 

D-003433-04 


GTAGTCAAGTGGTCTAAAT 


1504 














PPARA 


INM UUoUoO 


7549810 


D-003434-01 


TCAGGGAGCTQAGGGAATT 


1505 


PPARA 


INM UU5UOO 


754981 0 


D-003434-02 


GAACATGACATA6AAGATT 


1506 


PPARA 


KlttA AAf^AOA 
INIVI UUOUOO 


7549810 


D-0034 34-03 


GGATAGTTCTGGAAGCTTT 


1507 


PPARA 


KliVil AACAOA 

INM UUDUuO 


7549810 


D-003434-04 


GACTCAAGCTGGTGTATGA 


1508 














PPAPn 

1 1 Mr\L/ 


rMIVI>„.UUD^OO 


o45^3^39 


D-003435-01 


GAGCGCAGCTGCAAGATTC 


1509 


PPARn 


INM UUDZOO 


CA COAOA 

o45o939 


D-003435-02 


GCATGAAGGTGGAGTACGA 


1510 


PPARn 


INIVI UUD^OO 


CZA CO AO A 


D-003435-03 


GGAAGCAGTTGGTGAATGG 


1511 


PPARn 


INM UUOiCOO 


C/ICOAOA 

o4oo939 


A A A ^ A C A A 

l>003435-04 


GCTGCAAGATTCAGAAGAA 


1512 














ppARf; 


INM loo/^ iZ 


OAOOCOO A 


r\ AOOjIOA a /i 

D-003436-01 


AGACTCAGCTCTACAATAA 


1513 


PPARCn 


INM loorlZ 


OAOOOOO >l 


A/^OilOA A A 

D-003436-02 


GATTGAAGCTTATGTATGA 


1514 




INM 1 OOf IZ 


OAOOCOO/I 


AAA if A A A A 

D-003436-03 


AAGTAAGTGTCCTCAAATA 


1515 


PPARn 


MM I'^AT'IO 
INm 1 OO r T Z 


ZUoooZo4 


Pk AAO A OO A A 

U-003436-04 


GCATTTCTACTCCACATTA 


1516 














RARA 


INIVI uuuyo^ 


40Uo4jo 


AAO A A A 

D-003437-01 


GACAAGAACTGCATCATCA 


1517 


RARA 

1 AAA 1 VTA 


MM nc\nQf\A 
iNivi uuuyo^ 


40Ud4 To 


P\ A A A A A^ A A 

D-003437-02 


GCAAATAGACTAGGAACAA 


1518 


RARA 


MM nnnofiA 
INM uuuyD4 


4oUd41 o 


A A O A A ^ A A 

lJ-003437-03 


GAAGAAGAGCTCAGAACAA 


1519 


RARA 


NM 000964 


4506418 


D-003437-04 


G AGC AGC AGTTPTr; A An A r; 


i con 














RARB 


NM 000965 


14916493 


D-003438-01 


GCACACTGCTCAATCAATT 


1521 


RARB 


NM 000965 


14916493 


D-003438-02 


GCAGAAGTATTCAGAAGAA 


1522 


RARB 


NM 000965 


14916493 


D-003438-03 


GGAATGACAGGAACAAGAA 


1523 


RARB 


NM 000965 


14916493 


D-003438-04 


GCACAGTCCTAGCATCTCA 


1524 
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KIM nnnoRA 


Z1oo9o0l 


U-0034 39-01 


GAAATGACCGGAACAAGAA 


1525 


rvMrvVj 


INIVI UUU^iiOO 


ill ooyooi 


D~0034 39-02 


TAGAAGAGCTCATCACCAA 


1526 


rvArvva 


MM nnnciAfi 
INiVl uuuyoD 


^1ob9o51 


D-003439-03 


CAAGGAAGCTGTGCGAAAT 


1527 




MM nnnof^^^ 




r\ OOO A OA Ail 

U-003439-04 


TGAGTGAGCTGGCTACCAA 


1528 














RPlRA 


Nm 1 o4^0l 


19743902 


r> AAO J il /\ A 

D-003440-01 


GGAAAGAGTTTATGTTCTA 


1529 


rsvJKA 


NM 1 o4Zd1 


19743902 


D-003440-02 


CAAGATCTGTGGAGACAAA 


1530 


DORA 


Nivl lo4i:Dl 


1 9743902 


D-003440-03 


GCACCTGACTGAAGATGAA 


1531 


RPiRA 


iNlVI 1o4201 


19743902 


D*003440-04 


CCGAGAAGATG6AATACTA 


1532 














KUKd 


IMM UUoyi4 


19743906 


D-003441-01 


GCACAGAACATCATTAAGT 


1533 


DPkDQ 

KUKd 


NM O0d914 


19743906 


D-003441-02 


CCACACCTATGAAGAAATT 


1534 


DODO 

KUKd 


NM 00o914 


19743906 


D-003441-03 


GATCAAATTCTACTTCTGA 


1535 


KUKd 


NM 006914 


1 9743906 


D-003441-04 


TCAAACAGATAAAGCAAGA 


1536 














KUKu 


NM 005060 


19743908 


D-003442-01 


TAGAACAGCTGGAGTACAA 


1537 


KUKO 


NM 005060 


19743908 


D~003442-02 


TCACCGAGGCCATTCAGTA 


1538 


KUKO 


NM 005060 


19743908 


D-003442-03 


GAACAGCTGCAGTACAATC 


1539 


RORC 


NM 005060- 


19743908 


D-003442-04 


CCTCAT6GCACCTTGAATA 


1540 














DVD A 


NM 002957 


21536318 


D-003443-01 


TGACGGAGCTTGTGTCCAA 


1541 


DVD A 

RXRA 


NM 002957 


21536318 


D-003443-02 


CAACAAGGACTGGCTGATT 


1542 


DVD A 

RXRA 


NM 002957 


21536318 


D-003443-03 


GCAAGGACCTGACCTACAC 


1543 


DVD A 

r<XRA 


NM 002957 


21536318 


D-003443-04 


GCAAGGACCGGAACGAGAA 


1544 














RXRB 


NM 021976 


21687229 


D-003444-01 


GCAAAGACCTTACATACTC 


1545 


r<XRB 


NM 021976 


21687229 


D-003444-02 


GCAATCATTCTGTTTAATC 


1546 


RXRB 


NM 021976 


21687229 


D-003444-03 


TCACACCGATCCATTGATG 


1547 


r<ARD 


NM 021976 


21687229 


D-003444-04 


GCAAACGGCTATGTGCAAT 


1548 














DVD/^ 

RXRvj 


NM 006917 


21361386 


D-003445-01 


GGAAGGACCTCATCTACAC 


1549 


D VD/^ 


NM 006917 


21361386 


D-003445-02 


CCGGATCTCTGGTPAAACA 


1550 


RXRC3 


NM 006917 


21361386 


D-003445-03 


GGGAGCCATTGTACTCTTT 


1551 


DVD/^ 

RXRC3 


NM 006917 


21361386 


D-003445-04 


GAGCCATTGTACTCTTTAA 


1552 














TLID A 


NM 003250 


20127451 


D-003446-01 


GGACAAAGACGAGCAGTGT 


1553 


I liRA 


NM 0.Q3250 


20127451 


D-003446-02 


GGAAACAGAGGCGGAAATT 


1554 


TUID A 

1 MKA 


NM 003250 


20127451 


D-003446-03 


GTAAGCTGATTGAGCAGAA 


1555 


1 HRA 


NM 003250 


20127451 


D-003446-04 


GAACCTCCATCCCACCTAT 


1556 














1 nKb 


NM 000461 


10835122 


D-003447-01 


GAATGTCGCTTTAAGAAAT 


1557 


XLJDD 

1 MRd 


NM 000461 


10835122 


D-003447-02 


GAACAGTCGTCGCCACATC 


1558 


TWJPR 

1 nKD 


NM 000461 


10835122 


D-003447-03 


GGACAAGCACCAATAGTCA 


1559 


THRB 


NM 000461 




Ly-UUO*r*f f -U*f 


o 1 ooAAAVjVj I 1 VaAV^ 1 1 VavsA 


4 COA 














VDR 


NM 000376 


4507882 


D-003448-01 


TGAAGAAGCTGAACTTGCA 


1561 


VDR 


NM 000376 


4507882 


D-003448-02 


GCAACCAAGACTACAAGTA 


1562 


VDR 


NM_000376 


4507882 


D-003448-03 


TCAATGCTATGACCTGTGA 


1563 


VDR 


NM 000376 


4507882 


D-003448-04 


CCATTGAGGTCATCATGTT 


1564 
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GACCAUAAAUGUAAGGUUU 


1565 1 


UAGAAGAUCUGAUGUCAAA 


1566 


GAAAUGUUCACUUCAGUUA 


1567 


GAAGAUCGCUACUGAAGCA 


1568 







GGAAGCAACUGCAGAGACA 


1569 


GAUGACACCUCUGAAGAAA 


1570 


UAAAGUUGCUCAUCAAGUU 


1571 


CAACGAGUCUGGGGAAGGA 


1572 



ABCG2 



KCNH1 



CLCA1 







GCAGAUGCCUUCUUCGUUA ' 


1573 


AGGCAAAUCUUCGUUAUUA 


1574 


GGGAAGAAAUCUGGUCUAA 


1575 


UGAGUGAUCCCAACAUUUA 


1576 






CCGACGUGCUGCCUGAGUA 


1577 


GAGAAGAGCAGCGACACUU 


1578 


GAUCAUAGCACCUAAGAUA 


1579 


GCUAUUUACUGCUCUUAUU 


1580 


UCACUGGGCUCCUUUAAUU 


1581 


GUGCGAGCCUUCUGAAUAU 


1582 


GCUAAGCUAUACUAGUGUA 


1583 


UGACGGCGCUCUACUUCAC 


1584 








GAGAUGAAUUCCUUUGAAA 


1585 


GAAGAAGGCAUGAAACGAA 


1586 


GAUAAAGACAGGAUUGAAA 


1587 


GCUGAGAGGUGUAUUUAAA 


1588 








GAACAACAAUGGCUAUGAA 


1589 


GUACAUAQC.UGGCUGGAUU 


1590 


GAACAGCOCACAAGUAUAU 


1591 


GGAAAGGUGUGUGUAUAUU 


1592 



SLC6A1 



SLC6A2 



GGAGGUGGGAGGAGAGUUA 


1593 


UCACAGCCCUGGUGGAUGA 


1594 


GAAGGUGGCUGCUAUGUUC 


1595 


GGUCAAGAGUACCAACAUG 


1596 








GAACACAAGGUCAACAUUG 


1597 


AGAAGGAGGUGGGGUAGUG 


1598 


GGGAAACUGUUCAGAUUUG 


1599 


GAAGAAAUUUGACAACAAG 


1600 



SLC21A2 








GUACAUCUCCAUCUUAUUU 


1601 




GGAAGUGGGUGAGUUAUUA 


1602 



SLC21A3 



SLC28A1 



SLC29A1 



SLC26A1 



SLC26A2 



SLC4A4 



GLRA1 



KLK1 



GAAGGGAGGCUCAAUGUAA 


1603 


GAAGGAAGUGGCUGAGUUA 


1604 , 






GUAGAAACAGGAGCUAUUA 


1605 


CAAGAUUACUGUCAAACAA 


1606 


GCACAAGAGUAUUU6GUAA 


1607 


GCAAAUGUCGCUUCU6UAU 


1608 


GGAUGACUCGUAUAUAAUA 


1609 


AAACAGCAAUUUCCCUUAA 


1610 


GAAAAUGCCUCUUCAGGAA 


1611 








GUUCAUCGGUCUCCUCUUU 


1612 


GGAUCAAGCUGUUUCUGAA 


1613 


GGACUGCAGUUUGUACUUG 


1614 


GAGUGAAACUGACCUAUGG 


1615 








GAACGCUGCUCCCGUGGAA 


1616 


GAAAGCCACUCUAUCAAAG 


1617 1 


GAAACCAGGUGCCUUOAGA 


1618 


CCUCACAGCUGUAUUCAUG 


1619 






CCACGGAGCUGCUGGUCAU 


1620 


GGGUUGACAUCUUAUUUGA 


1621 


GCACGAGGGUCUCUGUGUU 


1622 


GGCCAUCGCCUACUCAUUG 


1623 


CAACACCGAUGGCAAUUAA 


1624 


GAGGAAAGAUCUUGCUGAU 


1625 


GAGCAAGCGUCCUCCAAAU 


1626 


GCAACACCCAUGGCAAUUA 


1627 






CCAAAGAACUCAAUGAACA 


1628 


ACAAGAACCUUCAGACUAA 


1629 


GAAGGUAGAUAGAAGAAUG 


1630 


GUAUUGAACUGOACUGUAA 


1631 








GGAAUUCUCUUCAUUUAUC 


1632 


GGAAAGAUGUCCACUGAAA 


1633 


GGACAAAGCCUUCUUCAAU 


1634 


GGAAUGGGAUCCAGCAAUU 


1635 








UGAAAGCCAUUGACAUUUG 


^ 1636 


CAGACACGCUGGAGUUUAA 


1637 


CAAUAGCGCUUUCUGGUUU 


1638 


GCAGGUAGCAGAUGGAGUA 


1639 








UCAGAGUGCUGUCUUAUGU 


1640 


CAACUUGUUUGACGACGAA 


1641 


UGACAGAGCCUGCUGAUAC 


1642 



! A GGCGGCUCUGUACCAUUU 



1643 



ADAM2 







GAAACAUGCUGUGAUAUUG 


^^1^41 


GCAGAUGUUUCCUUAUAUA 


1645 


CAACAGAGAUGCCAUGAUA 


1646 


GAAAGGCGCUACAUUGAGA 


1647 



XPNPE 






PI 








GACCUGAGCUUCCCAACAA 


1648 




GCGACUGGCUCAACAAU UA 


1649 1 




GAGAUUGCGUGGCUAUUUA 


1650 




GACAGCAACUGGACACUUA 


1651 



GZMA 







GGAAGAGACUCGUGCAAUG 


1 1652 


GGAACCAUGUGCCAAGUUG 


! 1653 


GAAGUAACUCCUCAUUCAA 


1 1654 


GAACUCCUAUAGAUUUCUG " 


i 1655 



CMKLR1 



GLN3 



CALCR 



OXTR 



EDG4 



EDG5 



CAUAGAAGCUUUACCAAGA 


1656 


GAAUGGAGGAUGAAGAUUA 


1657 


GGUCAAUGCUCUAAGUGAA 


1658 


GAGAGGAGUUCUAUGAAUG 


1659 







CAUCAUGCGUUGUGAAUAA 


1660 


CAACAGCUCAUCACGAUUU 


1661 


GGAACAACUUCUCUUAUGU 


1662 


GGUGUUCGCUAGCAUCUCA 


1663 








GGACCUAGCUGUUGUAAAG i 


1664 


GAAAGACCAUGCAUUUAAA 


1665 


GCAGGAAGAUGUAUGCUUU 


1666 


GAAUAAACCAGUAt>€GUUA 


1667 








GGACCCAGAUAUCCAAAUA 


1668 


GCAAUACUAUCCUAACUGA 


1669 


GAAUAUAGAUUAGCGUUUG 


1670 


GAU6AGGCAUGACUACUAA 


1671 








GCGAGUCUGUCCACUAUAC 


1672 


GAGAACGGCCACCCACUGA 


1673 


GAACGGCCACCCACUGAUG 


1674 


GGUCAAUGCUGCUGUGUAC 


1675 








UCCAGG AACACUAUAAU UA 


i 1676 


GUGACCAUCUUGUCCAUCA 


1677 


CAUCCUCUGUUGGGGCAUU 


1678 


CCAACAAGGUCCAGGAACA 


1679 
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EDG7 



PTCH 



SMO 



CASP3 



CASP6 



CASP7 



CASP8 



CASP9 



ACACUGAUACUGUCGAUGA 


1680 


AAUAGGAGCAACACUGAUA 


1681 


CAGGAGGAGUUACGUUGUU 


1682 


GGACACCCAUGAAGGUAAU 


1683 






GCACAGAACUCGAGUCAAA 


1684 


GGAGAGCAGUUCAUUGUUA 


1685 


GAGAAGAGGCUAUGUUUAA 


1686 


GGACAAACUUCGAGCCUUU 


1687 






UCGGUACCCUGCUGUUAUU 


1688 


GCUACAAGAAGUACGGAUA 


1689 


CAAGAAAGGUUGGUUGAAC 


1690 


GAGAAGAAAUAGAGUCAAU 


1691 








CAAUAUAUCUGAAGAGCUA 


1692 


GAACUGGACUGUGGGAUUG 


1693 


C3UGAGAAGAUGGUAUAUUU 


1694 


GAGGGUACUUUAAGAGAUA 


1695 





.CAUGAGGUGUCAACUGUUA 


1696 


GAAGUGAAAUGCUUUAAUG 


1 1697 


AAAUAUGGCUGCUCGUUAG 


1 1698 


GCAAUCACAUUUAUGCAUA 


1 1699 


CAACAUAACUGAGGUGGAU 


j 1700 


GAUGGUACAUUGAAGAUUU 


i 1701 







C3AACUCUACUUCAGUCAAU 


1704 


GGGCAAAUGCAUCAUAAUA 


1703 


CAACAGAGGGAGUUUAAUA 


1704 


GAACAAAGCCACUeACUGA 


1705 1 








GAAGUGAACUAUGAAGUAA 


1706 


CAACAAGGAUGACAAGAAA 


. 1707 


GGACAAAGUUUAGCAAAUG 


1708 


GAGGGUCGAUCAUCUAUUA 


1709 


GAAUAUAGAGGGCUUAUGA 


1710 


CAACGACUAUGAAGAAUUC 


1711 


GAAGUGAGCAGAUCAGAAU 


1712 


GAGGAAAUCUGCAAAUGGA 


1713 








CCAC3GCAGCUGAUCAUAGA 


1714 


UCUCAGGUGUUGCCAAAUA 


1715 


GAACAGCUGUAAUCUAUGA 


1716 


GCACUGGUCUGUAGGGAUU 


1717 



DVL2 



UCGUAAAGCUGUUGAUAUC 


1718 


GAGGAGAUCUUUGAUGACA 


1719 


GUAAAGCUGUUGAUAUGGA 


1720 


GAUGGUAAAGCUGUUGAUA 


1721 




AGACGAAGGUGAUUUACCA 


1722 


UGUGAGAGCUACCUAGUCA 


1723 


GAAGAAAUUUCAGAUGACA 


1724 


UAAUAGGCAUUUCCUCUUU 


1725 



PTEN 



GUGAAGAUCUUGACCAAUG 


1726 


GAUCAGGAUACACAAAUUA 


1727 


GAAUGAACCUUCUGCAAGA 


1728 


GGCGCUAUGUGUAUUAUUA 


1729 







GUACAAAGCUGGUAUAUGG 


1730 


GAAAGACUCCCAGUGUAUA 


1731 


GGAAGUCCAUCUCAUCGAA 


1732 


CCAAAGACAUGACGACGUU 


1733 



PDK2 







GUAAAGAGGAGACUGAAUG 


1734 


6GUCUGUGAUGGUCCCUAA 


1735 


CAAAGAUGCCUACGACAUG 


1736 


GGGCGAUGCCUGAGGGUUA 


1737 



PPP2CA 







UCACACAAGUUUAUGGUUU ' 


' 1738 


CAACAGCGGUGACCACUUU 


1739 


UAACCAAGCUGCAAUCAUG 


1740 


GAACUUGACGAUACUCUAA 


1741 



CTNNA1 







GAAGAGAGGUCGUUCUAAG 


1742 


AAGCAGAUQUGCAUGAUUA 


1743 


UCUAAUAACUGCAGUGUUU 


1744 


GUAAAGGGCGCUCUAAUAA 


1745 









GAAAGAAUAUGCCCAAGUU 


1746 


GAAGAAGAAUGCCAGAAUG 


1747 


GCAGGAAGAUUAUGAUGUG 


1748 


AAAGAAAGCCCAUGUACUA 


1749 



HSPCA 



GGGAAAGAGCUGCAUAUUA 


1750 


GCUUAGAACUCUUUACUGA 


1751 


UAUAAGAGCUUGACCAAUG 


1752 


GCAGAUAUCUCUAUGAUUG 


1753 



DCTN2 



CAACUCAUGUCCAAUACUG 
GGAAUGAGCCAGAUGUUUA 



1754 
1755 



GGAGACAGCUGUACGUU6U 


1756 


UCCAAGAGCUGACAACUGA 


1757 



CD2 









GUAAGGAGAAGCAAUAUAA 


1758 


AAGAUGAGCUUUCCAUGUA 


1759 


GGACAUCUAUCUCAUCAUU 


1760 


GACAAGAGCCCACAGAGUA 


1761 



BAD 







GUACUUCCCUCAGGCCUAU 


1762 


GCUGUGCCUUGACUACGUA 


1763 


GUACUUCCCUCAGGCCUAU 


1764 


GGUCAGGUGCCUCGAGAUC 


1765 



SMAC 







CAGCGUAACUUCAUUCUUC 


1766 


UAACUUCAUUCUUCAGGUA 


1767 


CAGCUGCUCUUACCCAUUU 


1768 


GAUUGAAGCUAUUACUGAA. "1 


1769 


UAGAAGAGCUCCGUCAGAA 


1770 


CCACAUAUGCGUUGAUUGA 


1771 


GGGCAGGGCUCUCUACCUA 


1772 



MAP3K5 



PVR 



ERBB2 



SOS1 



BRCA1 





GAACAGCCUUCAAAUCAAA j 


1773 


GAUGUUCUCUACUAUGUUA 


1774 


GCAAAUACUGGAAGGAUUA 


1775 


CAGGAAAGCUCGUAAUUUA 


1776 








CCACACGGCUGACCUCAUA 


1777 


CAGCAGAAUUCCUCUUAUA 


1 1778 


GCAGAAUUCCUCUUAUAAA 


1779 


GAUCGGGAUUUAUUUCUAU 


1780 








UGUGGGAGCUG^UGACUUU 


1781 


UCACAGAGAUCOUGAAAGG 


1782 


UGGAAGAGAUCACAGGUUA 


1783 


GCUCAUCGCUCACAACCAA 


1784 






GAGCACCACUUCUAUGAUU 


1785 


CAAAGAAGCUGUUCAAUAU 


1786 


UGAAAGCCCUCGCUUAUUA 


1787 


GAAAUAGCAUGGAGAAGGA 


1788 








CCAUACAGCUUCAUAAAUA 


1789 


gaagagaacuuaucuagug 


1790 


GAAGUGGGCUCCAGUAUUA 


1791 


gcaagaugcugauucauua 


1792 


GAAGUGGGCUCCAGUAUUA 


1793 


GAACGGACACUGAAAUAUU 


1794 


GCAGAUAGUUCUACCAGUA 


1795 
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CDKN1A 



CDKN1B 



GAACAAGGAGUCAGACAUU 


1796 


AAACUAGGCGGUUGAAUGA 


1797 


GAUGGAACUUGGACUUUGU 


1798 


GUAAACAGAUGGCACUUUG 


1799 








GGAAUGGACAUCCUGUAUA 


1800 


GGAGAAAGAUGUCAAACGU 


1801 


GAAUGGACAUCCUGUAUAA 


1802 ~' 


GUAAACAGCUCGAAUUAAG 


1803 



SLC2A4 











CAGAUAGGCUCCGAAGAUG 


1804 


AGACUCAGCUCCAGAAUAC 


1805 


GAUCGGUUCUUUCAUCUUC 


1806 


CAGGAUCGGUUCUUUCAUC 


1807 



NOS2A 



FRAP1 



FKBP1A 



TNFRSF 
1A 







CCAGAUAAGUGACAUAAGU 


1808 ' 


UAAGUGACGUGCUUUGUAA 


1809 


GAAGAGAGAUUCGAUUGAA 


1810 


UGAAAGAGCUCAACAACAA 


1811 








GAGCAUGCCGUCAAUAAUA 


1812 


CAAGAGAACUCAUCAUAAG 


1813 


CCAAAGUGCUGCAGUACUA 


1814 


UAAGAAAGCUAUCCAGAUU 


1815 






GAAACAAGGCGUUUAAGUU 


1816 


GAAUUACUCUCCAAGUUGA 


1817 


CAGGACAAGUGGUAGGUUA 


1818 


GUUGAGGAGUGAAUUACUG 


1819 


GAUGGCAGGUGUUUAAAUG 


1820 


GAGUAUCCUUUCAGLKiUUA 


1821 






CAAAGGAACCUACUUGUAC 


1822 


GGAACCUACUUGUACAAUG 


1823 


GAACCUACUUGUAGAAUGA 


1824 


GAGUGUGUCUCCUGUAGUA 


1825 









GGACAAGAAUCAAUGGAUA 


1826 


GAACAAGCCUCCAGGAUUC 


1827 


GGACUUGUGUGCCCUUAUA 


1828 


GAACACAAAGGCACUAUAA 


' 1829 



IRAKI 





CGAAGAAAGUGAUGAAUUU 


1830 


GCUCUUUGCCCAUGUCUUU 


1831 


UGAAAGACCUGGUGGAAGA 


1832 


GCAAUUCAGUUUCUACAUC 


1833 



TRAF2 



GAAGACAGAGUUAUUAAAC 


1834 


UCACGAAGACAGAGUUAUU 


1835 ' 


lAGACAGAGUUAUUAAAGCA 


1836 


GACGAAGACAGAGUUAUUA 


1837 


GCUGAAGCCUGUCUGAUGU 


1838 



CAAAUGAUCUGAGGCAGUU 


1 1839 


GUUCAUAGUUUGAGCGUUA 


1840 1 


GGAGAAACCUGUUGUGAUU 


1841 


GGACAAAGUUGCUGAAAUC 


1842 


CAAAUGAUCUGAGGCAGUU 


1843 


GGAGAAACCUGUUGUGAUU 


1844 


GGACAAAGUUGCUGAAAUC 


1845 


GUUCAUAGUUUGAGCGUUA 


1846 





UGAAGCACCUUGAUCUUUG 


1847 


GGGCAGCGCAUACCUGUUU 


1848 


GAGGAGCGCUGUUUGAGUU 


1849 


GGACGAGGAGCGCUGUUUG 


1850 


GAGGAGCGCUGUUUGAGUU 


1851 


GGAUGUCUCUCUCCUCUUU 


1852 


GCUCACUCCUUUCUACUAA 


1853 


UGAAGCACCUUGAUCUUUG 


1854 



GCACAGAUAUUUCCAUUUC 


1855 


GGAGUCCUCUUAUUCCUAA 


1856 


GAACUCAAGCUGCGUUUAU 


1857 


GGACGAAUUGAGAUAAUAU 


1858 





UAAGAACACUGCUCAUGAA 


i 1859 


GAGGCAUCCUGAAGCAUUA 


1860 


GAAGGCGGCUGCAGAACUG 


1861 


GGAACAAGGAGAUCAUGUA 


1862 



IKBKG 








CUAUCGAGGUCGUUAAAUU 


1863 




GAAUGCAGCUGGAAGAUCU 


1864 




GCGGCGAGCUGGACUGUUU 


1865 




CCAGACCGAUGUGUAUUUA . 


1866 








TNFRSF 
5 








GGUCUCACCUCGCUAUGGU 


1867 




GAAAGCGAAUUCCUAGACA 


1868 




GCACAAACAAGACUGAUGU 


1869 




GAAGGGCACCUCAGAAACA 


1870 




UCUCCCAACUUGUAUUAAA 


1871 








RELA 








UCAAGUGUCUUCCAUCAUG 


1 1872 




UCAAGUGCCUUAAUAGUAG 


1 1873 
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ARHA 



CDC42 



ROCK1 



PAK1 



PAK2 



GGAGUACCCUGAGGCUAUA 


1874 


GAUGAGAUCUUGCUACUGU 


1875 ^ 








GAGCUGGGCUAAGUAAAUA 


1876 


GACCAAAGAUGGAGUGAGA 


1877' 


GGAAGAAACUGGUGAUUGU 


1878 


GGCUGUAACUAGUUUAUAA 


1879 








GGACAUUUGUUUGCCAUUU 


1880 


GGAGAACCAUAUACUCUUG 


1881 


GAACCAAUGCUUUCUCAUG 


1882 


GAAGACCUGUUAUGUAGAG 


1883 


GAUCAAGAAUUGCAAUAUC 


1884 


GAAAAGGGGUGACCUAGUA 


1885 


UGACAAACCUUAUGGAAAA 


1886 








GGAAUGAGCUUCAGAUGCA 


1887 


GGACACAGCUGUAAGAUUG 


1888 


GACAAGAGAUUACAGAUAA 


1889 


GAAGAAACAUUCCCUAUUC 


1890 








GAGGGUGGUUUAUGAUUAA 


1891 


CAACAAAGAACAAUCACUA 


1892 


GAAGAAAUAUACACGGUUU 


1893 


UACAUGAGCUUUACAGAUA 


1894 








GGUAGGAGAUGAAUUGUUU 


1895 


AGAAGGAAGUGAUCAUUAA 


1896 


GUAGAGACCUCCAAUAUCA 


1897 


GAAACUGGCCAAACCGUUA 


1898 



PAK3 



GAUUAUCGGUGCAAAGGAA- - . 


1899 


GAGAGUGGCUGCAAGCUUO 


1900 


GACAAGAGGUG6CCAUAAA 


1901 


UUAAAUCGCUGUCUUGAGA 


1902 



PAK4 



PAK6 



PAK7 







ACUAAGAGGUGAACAUGUA 


1903 


GAUCAUGAAUGUCCGAAGA 


1904 


GAUGAGACCCUACUACUGA 


1905 


CAGCAAAGGUGGCAAAGAU 


1906 








UAAAGGGAGUUGUCCACUA 


1907 


GAAGGGACCUGCUUUCUUG 


1908 


GCAAAGACGUCCCUAAGAG 


1909 


CCAAUGGGGUGGCUGCAAA 


1910 








1 GAGCACGGCUUUAAUAAGU 


\ 1911 
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HDAC1 



CAAACUCCGUUAUGAUAUA j 


1912 


GGAUAAAGUUGUCUGAUUU 


1913 ' 


GGAAAUGCCUCCAUAAAUA 


1914 








GGACAUGGCUGUGAAUUGG 


1915 


AGAAAGAAGUCACCGAAGA 


1916 


GGACAAGGGCACCCAAUGA 


^ 1917 


CCACAGCGAUGACUACAUU 


1918 



HDAC2 











GCUGUUAAAUUAUGGCUUA 1 


1 1919 


GCAAAGAAAGCUAGAAUUG 


1920 


CAUCAGAGAGUCUUAUAUA 


1921 


CCAAUGAGUUGCCAUAUAA 


1922 



CREBBP 



BTRC 







GGCCAUAGCUUAAUUAAUC 


1923 


GCACAGGCGUUUACCAUGA 


1924 


GGACAGCCCUUUAGUCAAG 


1925 


GAACUGAUUCCUGAAAUAA 


1926 









CACAUAAACUCGUAUCUUA 


1927 


GAGAAGGCACUCAAGUUUA 


1928 


AGACAUAGUUUACAGAGAA 


1929 


GCAGAGAGAUUUCAUAACU 


1930 



RIPK2 



VAV1 



GGACAUCGACCUGUUAUUA 


1932 


UAAAUGAACUCCUACAUAG 


1933 


GGAAUUAUCUCUGAACAUA 


1934 



GCAGAAAUACAUCUACUAA 


1935 


GCUAUGAGCUGUUCUUCAA 


1936 


CGAGAAAGGUeUACUCAUC . 


1937 


GCUCAAGCGUGGAGACAUU 


1938 



VAV2 



GRB2 



GCUGAGCGCUUUGCAAUAA 


1940 


CAAGAAGUGUCAGGGGAAA 


1941 


UCACAGAGGCCAAGAAAUU 


1942 







UGGAAGCCAUCGCCAAAUA 


1942 


CAUCAGUGCAUGACGUUUA 


1943 


UGAAUGAGCUGGUGGAUUA 


1944 


UGCCAAAACUUACCUAUAA 


1945 



PLCG1 








GAGCUGCAGUCCAAUGAGA 


1946 




GAAACCAAGCCAUUAAUGA 


1947 




CCAAGGAGCUACUGACAUU 


1948 
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t- 



ITGB1 



ITGA4 



STAT1 



AGAGAAACAUGGCCCAAUA 


1949 










CCACAGACAUUUACAUUAA 


1950 


j GAAGGGAGUUUGCUAAAUU 


1951 


[GAACAGAUCUGAUGAAUGA . 


1952 


1 CAAGAGAGCUGAAGACUAU 


1953 












GCAUAUAUAUUCAGCAUUG 


1954 


CAACUUGACUGCAGUAUUG 


1955 


GAACUUAACUUUCCAUGUU 


1956 


GACAAGACCUGUAGUAAUU 


1957 






AGAAAGAGCUUGACAGUAA 


1958 


GGAAGUAGUUCACAAAAUA 


1959 


UGAAGUAUCUGUAUCCAAA 


'~ 1960 


GAGCUUCACUCGCUUAGUU 


1961 



KRAS2 







UAAGGACUCUGAAGAUGUA 


1962 


GAGAAAGUGUGUAAUUAUG 


1963 


GCUCAGGACUUAGCAAGAA 


1964 


GAAACUGAAUACCUAAGAU 


1965 


GAAACUGAAUACCUAAGAU 


1966 


UAAGGACUCUGAAGAUGUA 


1967 


GAGAAAGUGUGUAAUUAUG 


1968 


GCUCAGGACUUAGCAAGAA 


1969 



HRAS 





RH 




m 


CCAUCCAGCUGAUCCAGAA 


1970 


GAACCCUCCUGAUGAGAGU 


1971 


GAGGACAUCCACCAGUACA 


1972 



BflAF 



ELK1 



RALGDS 



PRKCA 





GAUUAGAGACCAAGGAUUU 




CCACUGAUGUGUGUUAAUU . 


1973 


CAAUAGAACCUGUCAAUAU 


1974 


GAAGACAGGAAUCGAAUGA 


1975 








GAUGUGAGUAGAAGAGUUA 


1975 


GGAAGAAUUUGUAGCAUUU 


1976 


GAACGACCUUUCUUUCUUU 


1977 


GGAGUCAUCUCUUCCUAUA 


1978 








GGAGAAGCCUCACCUCUUG 


1979 


GCAGAAAGGACUCAAGAUU 


1980 


GAGAACAACUACUCAUUGA 


1981 


GAACUUCUCGUCACUGUAU 


1982 








GGAUUGUUCUUUCUUCAUA 


1983 


GAAGGGUUCUCGUAUGUCA 


1 1984 
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GAAGAAGGAUGUGGUGAUU 


1985 


ggacugggaOggaacaaca ^ 


1986 










ggacagaaguggaaauauu 


1987 


ucaaagaggugaagauuaa 


1988 


GACCAAAUCUCAGUUGUUU 


1989 


ggagaauggugcuguuum 


1990 







gaagagaccaaaguauaau 


1991 


gaagaggggccacgucauu 


1992 


GGAAGAG ACCAAAG U AU AA 


1993 


ggauugagauugacgagaa 


1994 


ugagagaacgagaaaguug 


1995 


gugaaacccugucugcauu 


1996 


ggaugucucucaacaagua 


1997 


agaaguaggugaacacaua 


r 1998 








ugagaguccugaaacgaua 


1999 


GAUUGGAGAUUCUACAUUC 


2000 


ggucauggaugcaaaucuu 


2001 


GAAGGUAAGCGGACCAUUU 


2002 



MAPK9 



AAAGAGAGGUUAUCGUGAA 


2003 


GAUGAUAGGUUAGAAAUAG 


2004 


ACAAAGAAGUCAUGGAUUG 


2005 


GGAGGUGGAUCAUGAAAGA 


2006 







GAAAAGGGAUGAUGGGAUU 


2007 


GGUAGACGAUGGCAAAUAU 


2008 


GAGGGAAACCAGGGAUUUA 


2009 


U6AAACGAAUGGUGGAGAA 


2010 


UGACUCACCCAGAGAAAUA 


2011 


CGAAGAAAGCUAUCUCUGA 


[ .. 2012 


AGACUCACCUAGAGCUAAA 


2013 



BBC3 



CCUGGAGGGUCCUGUACAA 


2014 


GAGCAAAUGAGCCAAAC6U 


2015 


GGAGGGUCCUGUACAAUCU 


2016 


GACUUUGUCUGCACCAUGU 


2017 







CCAGGGAGCUUGAAAGUUU 


2018 


AAAGUGCAGUUCAGUAAUA 


2019 


GAGAAUCACUAACCAGAGA ■ 


2020 


GAGCCCAUCCCUAUUAUAA 


2021 



BCL2L11 







GAGACGAGUUUAACGCUUA 


2022 


AAAGCAACCUUGUGAUGUA 


2023 


GCGAGAAGGUAGACAAUUG 


2024 



BID 



GCAAAGCAACCUUCUGAUG 


2025 


AGACAGAGGCAGAAGGUAA 


2026 


GCAAGGAGGUUAGAGAAAU 


2027 


CAAGGAGGUUAGAGAAAUA 


2028 


UGUUACGACUGUUACGUUA 


2029 








GAAGACAUCAUCCGGAAUA 


2030 


CAACAGCGUUCCUAGAGAA 


2031 


GAAAUGGGAUGGACUGAAC 


2032 


ACGAUGAGGUGCAGACUGA 


2033 



BIRG2 



BIRC3 







GAAAGAAGCCUGCAUAUAA 


2034 


GAAAUUGACUCUACAUUGU 


2035 


ACAAAUAGCACUUAGGUUA 


2036 


GAAUACACCUGUGGUUAAA 


2037 






GGAGAUGCCUGCCAOUAAA 


2038 


UGAAUGAUCUUGUGUUAGA 


2039 


GAAAGAACAUGUAAAGUGU 


2040 


GAAGAAAGAACAUGUAAAG 


2041 



BIRC4 



BIRC5 



BRCA1 



CARD4 



GASP10 







GUAGAUAGAUGGCAAUAUG 


2042 


GAGGAGGGCUAACUGAUUG I 


2043 


GAGGAACCCUGCCAUGUAU . i 


2044 


GCACGGAUCUUUACUUUUG ^ 1 


2045 






GGCGUAAGAUGAUGGAUUU 


2046 


GCAAAGGAAACCAACAAUA 


2047 


GCACAAAGCCAUUCUAAGU 


2048 


CAAAGGAAACCAACAAUAA 


2049 








CCAUACAGCUUCAUAAAUA 


. - - 2050 


GAAGAGAACUUAUCUAGUG 


2051 


GAAGUGGGCUGCAGUAUUA 


2052 


GCAAGAUGCUGAUUCAUUA 


2053 


CCAUACAGCUUCAUAAAUA 


2054 








GAAAGUUAAUGUCAAGGAA 


2055 


GAGCAACACUGGCAUAACA 


2056 


UAACAGAGAUUUGCCUAAA 


2057 


GCGAAGAGCUGACCAAAUA 


2058 








CAAAGGGUUUCUCUGUUUA 


2059 


GAAAUGACCUCCCUAAGUU 


i 2060 


GAAGGCAGCUGGUAUAUUC 


2061 


GACAUGAUCUUCCUUCUGA 


2062 


GCACUCUUCUGUUCCCUUA 


2063 
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CASP2 



1 GUAUUAAACUCUCCUUUGA 


2064 


GCAAGGAGAUGUCUGAAUA 


2065 


"CAACUUCCCUGAUCUUUAA 


2066 ' 


GCUCAAAGAUGUAAUGUAG 


2067 



CDKN1A 





GAACAAGGAGUCAGACAUU 


2068 


AAACUAGGCGGUUGAAUGA 


2069 ! 


GAUGGAACUUCGACUUUGU 


2070 


J3UAAACAGAUGGCACUUUG 


2071 



CFLAR 







GAUGUGUCCUCAUUAAUUU 


2072 


GAAGAGAGAUACAAGAUGA 


2073 


GAGCAUAGCUGAAGAGAGA 


2074 


GCUAUGAAGUCCAGAAAUU 


2075 



CLK2 







GUGAAUAUGUGAAAUAGUG 


2076 


^AAAGCAUGCUAGAGUAUGA 


2077 


UUAAGAAUGUGGAGAAGUA 


2078 


GAUAACAAGCUGACACAUA 


2079 



CLSPN 



GCAGAUGGGUUCU UAAAUG 


2081 


CAAAUGAGGUUGAGGAAAU 


2082 


GGAAAUACCUGGAGGAUGA 


2083 



CSNK2A 
1 




GCAUUUAGGUGGAGACUUC 


2085 j 


GAUGUACGAUUAUAGUUUG 


2086 j 


UGAAUUAGAUCCAGGUUUC 


2087 1 



CTNNB1 



CXCR4 



CXCR6 





GCACAAGAAUGGAUCACAA 1 


i 2088 


GGUGAAACAUGCAGUUGUA 


2089 


GUACGUACCAUGCAGAAUA 


2090 


GAACUUGCAUUGUGAUUGG 


2091 








GAAGCAUGACGGACAAGUA 


2092 


GAACAUUCGAGAGCGUGUA 


2093 


GUUCUUAGUUGCUGUAUGU 


2094 


CAUCAUGGUUGGCCUUAUC 


2095 1 






GGAACAAACUGGCAAAGCA 


2096 


GAUCAGAGCAGCAGUGAAA 


2097 


GGGCAAAACUGAAUUAUAA 


2098 


GAUCUCAGGUUGUCCUUGA 


2099 



DAXX 



CUACAGAUCUCCAAUGAAA 


2100 ' 


fGCUACAAGCUGGAGAAUGA 


2101 ' 


GGAAACAGCUAUGUGGAAA 


2102 


GGAGUUGGAUCUGUCAGAA 


2103 



GAS41 



GUAGUAAGCUAAACUGAAA 



GACAAUAUGUUCAAGAGAA 



GACAACAUCUCGUCAGCUA 



UAUAUGAUGUGUGGAGUAA 



2104 



2105 



2106 



2107 



GTSE1 







CAAAGAAGCUCACUUACUG 


2108 


GAACAGCCCUAAAGUGGUU 


2109 


GAACAUGGAUGACGCUAAG 


2110 


, GGGCAAAGGUAAAUCAAGU 


2111 



HDAC3 



HDAC5 







GGAAAGCGAUGUGGAGAUU 


2112 ^1 


CCAAGACCGUGGCCUAUUU 


2113 j 


AAAGCGAUGUGG AGAUU UA 


2114 1 


GUGAGGAGCUUCCCUAUAG 


2115 









GAAUUCCUCUUGUCGAAGU 


2116 


GUUAUUAGCACCUUUAAGA 


2117 


GGAGGGAGGCCAUGACUUG 


2118 


CAGGAGAGCUCAAGAAUGG 


2119 


GGAUAUGGAUUUCAGUUAA 


2120 


GGAAGUCGGUGCCUUGGUU 


2121 


GGAAGGAGAGGACUGGUUU 


2122 



HEC 



IGCAGAUACUUGCACGGUUU 



GAGUAGAACUAGAAUGUGA 



I GCGAAUAAAUCAUGAAAGA 



I GAAGAUGGAAUUAUGCAUA 



2123 



2124 



2125 



2126 



HIST1H2 




AA 


GGCAAUGCGUCUCGCGAUA 


2127 




GAUCGGCAAUGAUGAGGAA 


2128 




GGAAUGCGUCUCGCGAUAA 


2129 




GAGGAACUCAAUAAGCUUU 


2130 



LMNB1 



LMNB2 







AAUAGAAGCUGUGCAAUUA 


2131 


CAACUGACCUCAUCUGGAA 


2132 


GAAGGAAUCUGAUCUUAAU 


2133 


GGGAAGGGUUUCUCUAUUA 


2134 








GGAGGUUCAUUGAGAAUUG 


2134 


GGCAAUAGCUCACCGUUUA 


2135 


CAAAUAGGCUUAGGUGUGU 


2136 


GGAGAUCGCCUAGAAGUUC 


2137 
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MYB 



MYT1 







GCAGAAACACUCCAAUUUA 


2138 


GUAAAUACGUGAAUGCAUU 


2139 


GCACUGAACUUUUGAGAUA 


2140 


GAAGAACAGUCAUUUGAUG 


2141 








GAGGUGAGGU6UUAAAUCA 


2142 


GCAGGGUGAUUUCCUAAUA 


2143 


GGGAGAAGAUAUUUAAUUG 


2144 


CAACUUCUCUCGUGAACUU 


2145 



GGACACGGCACUGCACUUG 



2146 



GCACUUGGCUGUGAUUCAU 



2148 



GAGACGAGGGCGAUGAAUA 



2149 



CAUGAACCCUUGCUGGAUU 



2150 



NFKBIA 



GAACAUGGACUUGUAUAUU 


2151 


GAUGUGGGGUGAAAAGUUA 


2152 ' 


GGACGAGAAAGAUCAUUGA 


2153 


AGGACGAGCUGCCCUAUGA 


2154 



NFKBIE 



NUMA1 



GAAGGGAAGUUUCAGUAAC 


2155 


GGAAGGGAAGUUUCAGUAA 


2156 


GGAAACUGCUGCUGUGUAG 


2157 


GAACCAACCACUCAUGGAA 


2158 








gggaacaguuug/\a^Ijm^^^i 


2159 


GCAGUAGCGUGAAGCAGAA 


2160 


CGAGAAGGAUGCACAGAUA 


2161 


GCAA6AGGCUGAGAGGAAA 


2162 







GAAGACAAAUGAAAGCUAA 


2163 


GAUAAAGACUGCUGUUAGA 


2164 


GAGGAGAGCUCUAAUAUUA 


2165 


GAGGAAGCCUGAUUAAAGA 


2166 



0PA1 







GAAAGAGCAUGAUGACAUA 


2167 


GAGGAGAGCUCUAUUAUGU 


2168 


GAAACUGAAUGGAAGAAUA 


2169 


AAAGAAGGCUGUAGCGUUA 


2170 



PARVA 



CUACAUGUCUUU6CUCUUA 



GCUAAGUCCUGUAAGAAUA 


2172 


CAAAGGCAAUGUAGUGUUU 


2173 


GAAGAAUGGUGGAUCCAAA 


2174 



2171 



PIK3CG 




AAGUUCAGGUUGUCUAUUA 



PRKDC 



RASA1 



GAAGAAAUCUCUGAUGGAU 


2176 


' GAACACCUUUACUCUAUAA 


2177 ] 


GCAUGGAGCUGGAGAACUA 


2178 ! 








GAUGAAAGCUCUAAAGAUG 


2179 


GAAAGGAGGUUCUAAACUA 


2180 


GGAAGAAGCUCAUUUGAUU 


2181 


GCAAAGAGGUGGCAGUUAA 


2182 








GGAAGAAGAUCCACAUGAA 


2183 


GAACAUACUUUCAGAGCUU 


2184 


GAACAAUCUUUGCUGUAUA 


2185 


UAACAGAACUGCUUCAACA 


2186 



SLC9A1 









GAAGAGAUCCACACACAGU 


2187 


UCAAUGAGCUGCUGCACAU 


2188 


GAAGAUAGGUUUCCAUGUG 


2189 


GAAUUACCCUUCCUCAUCU 


2190 



TEGT 







CUACAGAGCUUCAGUGUGA 


2191 


GAACAUAUUUGAUCGAAAG 


2192 


GAGCAAACCUAGAUAAGGA 


2193 


GGAUUGAUGUCUUGUUAGA ' 


2194 



TERT 





M 






GGAAGACAGUGGUGAACUU 


2195 


GCAAAGCAUUGGAAUCAGA 


2196 


GAGGUGAGGUGGAAGAUGA 


2197 


GAAGGGGGCUGGAACCAUA 


2198 



TNFRSF 
6 



TOP1 



T0P2A 



T0P3A 







GAUACUAACUGCUCUCAGA 


2199 


GAAAGAAUGGUGUCAAUGA 


2200 


UCAAUAAUGUCGCAUGUAA 


2201 


UCAUGAAUCUCCAACCUUA 


2202 


GAUGUUGACUUGAGUAAAU 


2203 






GAAAGGAAAUGACUAAUGA 


2204 


GAAGAAGGCUGUUCAGAGA 


2205 


GGAAGUAGCUACGUUCUUU 


2206 


GGACAUAAGUGGAAAGAAG 


2207 








GAAAGAGUCCAUCAGAUUU 


2208 


CAAACUACAUUGGCAUUUA 


2209 


AAACAGACAUGGAUGGAUA 


2210 


CGAAAGGAAUGGUUAACUA 


1 2211 




GCAGAAAUCUUCCACAGAA 


2212 



T0P3B 



GAAACUAUCUGGAUGUGUA 


2213 


CCACAAAGAUGGUAUCGUA 


2214 


GGAAAUGGCUGUGGUAACA 


2215 








GAGACAAGAUGAAGACUGU 


2216 


GCACAUGGGCUGCGUCUUU 


2217 


CCAGUGCGGUUCAAGAUGA 


2218 


GAACAUCUGCUUUGAGGUU 


2219 



WEE1 







GGUAUUGCCUUGUGAAUUU 


2220 


GCAGAACAAUUACGAAUAG 


2221 


GUACAUAGCUGUUUGAAAU 


2222 


GCUGUAAACUUGUAGCAUU 


2223 



In addition, to identifying functional siRNA against gene families or 
pathways, it is possible to design duplexes against genes known to be involved in 
5 specific diseases. For example when dealing with human disorders associated with 
allergies, it will be beneficial to develop siRNA against a number of genes including 
but not limited to: 
the interleukin 4 receptor gene 
(SEQ. ID NO. 2224: UAGAGGUGCUCAUUCAUUU, 
10 SEQ. ID NO. 2225: GGUAUAAGCCUUUCCAAGA, 
SEQ. ID NO. 2225: ACACACAGCUGGAAGAAAU, 
SEQ. ID NO. 2226: UAACAGAGCUUCCUUAGGU), 

the Beta-arrestin-2 
15 (SEQ.IDN0.2227:GGAUGAAGGAUGACGACUA, 
SEQ. ID NO. 2228: ACACCAACCUCAUUGAAUU, 
SEQ. ID NO. 2229: CGAACAAGAUGACCAGGUA, 
SEQ. ro NO. 2230: GAUGAAGGAUGACGACUAU, ), 

20 flie intearferon-gamma recq>tor 1 gene 

(SEQ. ID NO. 2231 : CAGCAUGGCUCUCCUCUUU, 
SEQ. ID NO. 2232: GUAAAGAACUAUGGUGUUA, 
SEQ. ID NO. 2233: GAAACUACCUGUUACAUUA, 
SEQ. ID NO. 2234: GAAGUGAGAUCCAGUAUAA), 

25 
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the matrix metalloproteinase MMP-9 
(SEQ. ID NO. 2235: GGAACCAGCUGUAUUUGUU, 
SEQ. ID NO. 2236: GUUGGAGUGUUUCUAAUAA, 
SEQ: id no. 2237: GCGCUGGGCUUAGAUCAUU, 
5 SEQ. ID NO. 2238: GGAGCCAGUUUGCCGGAUA), 

the Slcl lal (Nrampl) gene 

(SEQ. ID NO. 2239: CCAAUGGCCUGCUGAACAA, 
SEQ. ID NO. 2240: GGGCCUGGCUUCCUCAUGA, 
10 SEQ. ID NO. 2241 : GGGCAGAGCUCCACCAUGA, 
SEQ. ID NO. 2242: GCACGGCCAUUGCAUUCAA), 

SPINK5 

(SEQ. ID NO. 2243: CCAACUGCCUGUUCAAUAA, 
1 5 SEQ. ID NO. 2244: GGAUACAUGUGAUGAGUUU, 
SEQ. ID NO. 2245: GGACGAAUGUGCUGAGUAU, 
SEQ. ID NO. 2246: GAGCUUGUCUUAUUUGCUA,), 

the CYP1A2 gene 
20 (SEQ. ID NO. 2247: GAAAUGCSGUGUCUUCGUA, 
SEQ. ro NO. 2248: GGACAGCACUUCCCUGAGA, 
SEQ. ID NO. 2249: GAAGACACCACCAUUCUGA, 
SEQ. ID NO. 2250: GGCCAGAGCUUGACCUUCA), 

25 thymosin-beta4Y 

(SEQ. ID NO. 2251: GGACAGGCCUGCGUUGUUU, 
SEQ. ID NO. 2252: GGAAAGAGGAAGCUCAUGA, 
SEQ. ID NO. 2253: GCAAACACGUUGGAUGAGU, 
SEQ. ID NO. 2254: GGACUAUGCUGCCCUUUUG, 

30 

activin A receptor IB 

(SEQ. ID NO. 2255: ACAAGACGCUCCAGGAUCU, 
SEQ. ID NO. 2254: GCAACAGGAUCGACUUGAG, 
SEQ. ID NO. 2255: GAAGCUGCGUCCCAACAUC, 



SEQ. ID NO. 2256: GCAUAGGCCUGUAAUCGUA, 
SEQ. IDNO. 2257: UCAGAGAGUUCGAGACAAA, 
SEQ. ID NO. 2258: UGCGAAAGGUUGUAUGUGA, 
SEQ. ID NO. 2259: GCAACAGGAUCGACUUGAG, 
5 SEQ. ID NO. 2260: GAAUAGCGUUGUGUGUUAU, 
SEQ. ID NO. 2261: UGAAUAGCGUUGUGUGUUA, 
SEQ. IDNO. 2262: GGGAUCAGUUUGUUGAAUA, 
SEQ. ID NO. 2263: GAGCCUGAAUCAUCGUUUA, ), 

10 ADAM33 

(SEQ. ID NO. 2264: GGAAGUACCUGGAACUGUA, 
SEQ. ID NO. 2265: GGACAGAGGGAACCAUUUA, 
SEQ. IDNO. 2266: GGUGAGAGGUAGCUCCUAA, 
SEQ. ID NO. 2267: AAAGACAGGUGGCCACUGA), 

15 

the TAP 1 gene 

(SEQ. ID NO. 2268 : GAAAGAUGAUCAGCUAUUU, 
SEQ. ID NO. 2269: CAACAGAACCAGACAGGUA, 
SEQ. ID NO. 2270: UGAGAAAUGUUCAGAAUGU, 
20 SEQ. ID NO. 2271 : UACCUUCAGUGGAAACUUA, 

COX-2 

(SEQ. IDNO. 2272: GAACGAAAGUAAAGAUGUU, 
SEQ. ID NO. 2273: GGACUUAUGGGUAAUGUUA, 
25 SEQ. ID NO. 2274: UGAAAGGACUUAUGGGUAA, 
SEQ. ID NO. 2275: GAUCAGAGUUCACUUUCUU), 

ADPRT 

(SEQ. ID NO. 2276: GGAAAGAUGUUAAGCAUUU, 
30 SEQ. ID NO. 2277: CAUGGGAGCUCUUGAAAUA, 
SEQ. ID NO. 2278: GAACAAGGAUGAAGUGAAG, 
SEQ. ID NO. 2279: UGAAGAAGCUCACAGUAAA, ), 



HDC 
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(SEQ. ID NO. 2280: CAGCAGACCUUCAGUGUGA, 
SEQ. ID NO. 2281: GGAGAGAGAUGGUGGAUUA, 
SEQ. ID NO. 2282: GUACAGAGCUGGAGAUGAA, 
SEQ. ID NO. 2283: GAACGUCCCUUCAGUCUGU), 

5 

HmnT 

(SEQ. ID NO. 2284: CAAAUUCUCUCCAAAGUUC, 
SEQ. ID NO. 2285: GGAUAUAUCUGACUGCUUU, 
SEQ. ID NO. 2286: GAGCAGAGCUUGGGAAAGA, 
10 SEQ. ID NO. 2287: GAUAUGAGAUGUAGCAAAU), 

GATA-3 

(SEQ. ID NO. 2288: GAACUGCUUUCUUUCGXJUU, 
SEQ. ID NO. 2289: GCAGUAUCAUGAAGCCUAA, 
1 5 SEQ. ID NO. 2290: GAAACUAGGUCUGAUAUUC, 
SEQ. ID NO. 2291: GUACAGCUCCGGACUCUUC), 

Gab2 

(SEQ. ID NO. 2292: GCACAACCAUUCUGAAGUU, 
20 SEQ. ID NO. 2293: GGACUUAGAUGCeCAGAUG, 
SEQ. ID NO. 2294: GAAGGUGGAUUCUAGGAAA, 
SEQ. ID NO. 22^5: GGACUAGCCCUQCUGUUUA), and 

STAT6 

25 (SEQ. ID NO. 2296: GAUAGAAACUCCUGCUAAU, 
SEQ. ID NO. 2297: GGACAUUUAUUCCCAGCUA, 
SEQ. ID NO. 2298: GGACAGAGCUACAGACCUA, 
SEQ. ID NO. 2299: GGAUGGCUCUCCACAGAUA). 

30 In addition, rationally designed siRNA or siRNA pools can be directed against 

genes involved in anemia, hemophila or hypercholesterolemia. Such g^es would 
include, but are not be limited to: 
AP0A5 

(SEQ. ID NO. 2300: GAAAGACAGCCUUGAGCAA, 
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SEQ. ID NO. 2301 : GGACAGGGAGGCCACCAAA, 
SEQ. ID NO. 2302: GGACGAGGCUUGGGCUUUG, 
SEQ. ID NO. 2303: AGCAAGACCUCAACAAUAU), 

5 HMG-CoA reductase 

(SEQ. ID NO. 2304: GAAUGAAGCUUUGCCCUUU, 
SEQ. ID NO. 2305: GAACACAGUUUAGUGCUUU, 
SEQ. ED NO. 2306: UAUCAGAGCUCUUAAUGUU, 
SEQ. ID NO. 2307: UGAAGAAUGUCUACAGAUA), 

10 

N0S3 

(SEQ. ID NO. 2308: UGAAGCACCUGGAGAAUGA, 
SEQ. ID NO. 2309: CGGAACAGCACAAGAGUUA, 
SEQ. ID NO. 2310: GGAAGAAGACCUUUAAAGA, 
15 SEQ. ID NO. 2309: GCACAAGAGUUAUAAG AUG), 

ARH 

(SEQ. ID NO. 2310: CGAUACAGCUUGGCACUUU, 
SEQ. ID NO. 2311: GAGAAGCGCUGCCCUGUGA, 
20 SEQ. ID NO. 23 12: GAAUCAUGCUGUUC13fClJUU, 
SEQ. ID NO. 2313: GGAGUAACCGGACACCUUA), 

CYP7A1 

(SEQ. ID NO. 2314: UAAGGUGACUCGAGUGUUU, 
25 SEQ. ID NO. 2315: AAACGACACUUUCAUCAAA, 
SEQ. ID NO. 2316: GGACUCAAGUUAAAGUAUU, 
SEQ. ID NO. 2317: GUAAUGGACUCAAGUUAAA), 

FANCA 

30 (SEQ. ID NO. 23 1 8: GGACAUCACUGCCCACUUC, 
SEQ. ID NO. 2319: AGAGGAAGAUGUUCACUUA, 
SEQ. ID NO. 2320: GAUCGUGGCUCUUCAGGAA, 
SEQ. ID NO. 2321: GGACAGAGGCAGAUAAGAA), 
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FANCG 

(SEQ. ID NO. 2322: GCACUAAGCAGCCUUCAUG, 
SEQ. IDNO. 2323: GCAAGCAGGUGCCUACAGA, 
SEQ. ID NO. 2324: GGAAUUAGAUGCUCCAUUG, 
5 SEQ. ID NO. 2325: GGACAUCUCUGCCAAAGUC), 

ALAS 

(SEQ. ID NO. 2326: CAAUAUGCCUGGAAACUAU, 
SEQ. ID NO. 2327: GGUUAAGACUCACCAGUUC, 
1 0 SEQ. ID NO. 2328: CAACAGGACUUUAGGUUCA, 
SEQ. ID NO. 2329: GCAUAAGAUUGACAUCAUC), 

PIGA 

(SEQ. ID NO. 2330: GAAAGAGGGCAUAAGGUUA, 
1 5 SEQ. ID NO. 233 1 : GGACUGAUCUUUAAACUAU, 
SEQ. ID NO. 2332: UCAAAUGGCUUACUUCAUC, 
SEQ. ID NO. 2333: UCUAAGAACUGAUGUCUAA), and 

factor VIII 

20 (SEQ. ID NO. 2334: GCAAAUAGAUCUCCAUOAC, 

SEQ. ID NO. 2335: CCAGAUAUGUCGUUCUUUA, 

SEQ. ID NO. 2336: GAAAGGCUGUGCUCUCAAA, 

SEQ. IDNO. 2337: GGAGAAACCUGCAUGAAAG, 

SEQ. IDNO. 2338; CUUGAAGCCUCCUGAAUUA, 
25 SEQ. ID NO. 2339: GAGGAAGCAUCCAAAGAUU, 

SEQ. ID NO. 2340: GAUAGGAGAUACAAACUUU). 

FurthoTOore, rationally designed siRNA or siRNA pools can be directed 
against genes involved in disorders of the brain and nervotis system. Such genes 
30 would include, but are not be limited to: 
APBBl 

(SEQ. ID NO. 2341 : CUACGUAGCUCGUGAUAAG, 
SEQ. IDNO. 2342: GCAGAGAUGUCCACACGUU, 
SEQ. ID NO. 2343 : CAUGAGAUCUGCXJCUAAGA, 



SEQ. ID NO. 2344: GGGCACCUCUGCUGUAUUG), 



BACEl 

(SEQ. ID NO. 2345: CCACAGAGCAAGUGAUUUA, 
5 SEQ. ID NO. 2346: GCAGAAAGGAGAUCAUUUA, 
SEQ. ID NO. 2347: GUAGCAAGAUCUUUACAUA, 
SEQ. ID Np. 2348: UGUCAGAGCUUGAUUAGAA), 

PSENl 

10 (SEQ. ID NO. 2349: GAGCUGACAUUGAAAUAUG, 
SEQ. ID NO. 2350: GUACAGCUAUUUCUCAUCA, 
SEQ. ID NO. 2351: GAGGUUAGGUGAAGUGGUU, 
SEQ. ID NO. 2352: GAAAGGGAGUCACAAGACA, 
SEQ. ID NO. 2353: GAACUGGAGUGGAGUAGGA, 

15 SEQ. ID NO. 2354: CAGCAGGCAUAUCUCAUUA, 
SEQ. ID NO. 2355: UCAAGUACCUCCCUGAAUG), 

PSEN2 

(SEQ. ID NO. 2356: GCUGGGAAGUGGCUUAAUA, 
20 SEQ. ID NO. 2357: CAUAUUCCCUGCCCUGAUA^^ 
SEQ. ID NO. 2358: GGGAAGUGCUCAAGACCUA, 
SEQ. ID NO. 2359: CAUAGAAAGUGACGUGUUA), 

MASSl 

25 (SEQ. ID NO. 2360: GGAAGGAGCUGUUAUGAGA, 
SEQ. ID NO. 2361 : GAAAGGAGAAGCUAAAUUA, 
SEQ. ID NO. 2362: GGAGGAAGGUCAAGAUUUA, 
SEQ. ID NO. 2363: GGAAAUAGCUGAGAUAAUG, ), 

30 ARX 

(SEQ. ID NO. 2364: CCAGACGCCUGAUAUUGAA, 
SEQ. ID NO. 2365: CAGCACCACUCAAGACCAA, 
SEQ. ID NO. 2366: CGCCUGAUAUUGAAGUAAA, 
SEQ. ID NO. 2367: CAACAUCCACUCUCUCUUG) and 



NNMT 

(SEQ. ID NO. 2368: GGGCAGUGCUCCAGUGGUA, 
SEQ. ID NO. 2369: GAAAGAGGCUGGCUACACA, 
5 SEQ. ID NO. 2370: GUACAGAAGUGAGACAUAA, 
SEQ. ID NO. 2371 : GAGGUGAUCUCGCAAAGUU). 

In addition, rationally designed siRNA or siRNA pools can be directed against 
genes involved in hypertension and related disorders. Such genes would include, but 
10 are not be limited to: 

angiotensin II type 1 receptor 

(SEQ. ID NO. 2372: CAAGAAGCCUGCACCAUGU, 
SEQ. ID NO. 2373: GCACUUCACUACCAAAUGA, 
SEQ. ID NO. 2374: GCACXJGGUCCCAAGUAGUA, 
15 SEQ. ID NO. 2375: CCAAAGGGCAGUAAAGUUU, 
SEQ. ID NO. 2376: GCUCAGAGGAGGUGUAUUU, 
SEQ. ID NO. 2377: GCACUUCACUACCAAAUGA, 
SEQ. ID NO. 2378: AAAGGGCAGUAAAGUUU), 

20 AGTR2 

(SEQ. ID NO. 2379: GAACAUCUCUGGCAACAAU, 
SEQ. ID NO. 2380: GGUGAUAUAUCUCAAAUUG, 
SEQ. ID NO. 2381: GCAAGCAUCUUAUAUAGUU, 

SEQ. ID NO. 2382: GAACCAGUCUUUCAACUCA), and other related targets. 

25 

Example Xni: Validation of Multigene Knockout using Rab5 and Eps 

Two or more genes having similar, overlapping functions often leads to 
genetic redundancy. Mutations that knockout only one of, e.g., a pair of such genes 
(also referred to as homologs) results in little or no phenotype due to the fact that the 
30 remaining intact gene is capable of fulfilling the role of the disrupted counterpart. To 
fully understand the function of such genes in cellular physiology, it is often 
necessary to knockout or knockdown both homologs simultaneously. Unfortunately, 
concomitant knockdown of two or more genes is firequently difficult to achieve in 
higher organisms {e.g. mice) thus it is necessary to introduce new technologies dissect 
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gene function. One such approach to knocking down multiple genes simultaneously 
is by using siRNA. For example, Figure 11 showed that rationally designed siRNA 
directed against a number of genes involved in the clathrin-mediated endocytosis 
pathway resulted in significant levels of protein reduction (e.g. >80%). To determine 
5 the effects of gene knockdown on clathrin-related endocytosis, internalization assays 
were performed using epidermal growth factor and transferrin. Specifically, mouse 
receptor-grade EGF (Collaborative Research Inc.) and iron-saturated human 
transferrin (Sigma) were iodinated as described previously (Jiang, X., Huang, P., 
Marusyk, A. & Sorkin, A. (2003) MolBiol Cell 14, 858-70). HeLa cells grown in 12- 

10 well dishes were incubated with '^^I-EGF (1 ng/ml) or *^^I-transferrin (1 |ng/ml) in 
binding medium (DMEM, 0.1% bovine serum albumin) at and the ratio of 
internalized and surface radioactivity was determined during 5-min time coxirse to 
calculate specific internalization rate constant ke as described previously (Jiang, X et 
fl/.).^The measurements of the uptakes of radiolabeled transferrin and EGF were 

1 5 performed using short time-coxirse assays to avoid influence of the recycling on the 
uptake kinetics, and using low ligand concentration to avoid saturation of the clathrin- 
dependent pathway (for EGF Lund, K. A., Opresko, L. K., Strarbuck, C, Walsh, B. J. 
& Wiley, H. S. (1990) J. Biol Chem. 265, 15713-13723). 

20 The effects of knocking down RabSa, 5b, 5c, Eps, of Bps 1 5R (individually) 

are shown in Figure 22 and demonstrate that disruption of single genes has little or no 
effect on EGF or Tfii internalization. In contrast, simultaneous knock down of Rab5a, 
5b, and 5c, or Eps and Eps 1 5R, leads to a distinct phenotype (note: total 
concentration of siRNA in these experiments remained constant with that in 

25 expmments in which a single siRNA was introduced, see Figure 23). These 

experiments demonstrate the effectiveness of using rationally designed siRNA to 
knockdown multiple genes and validates the utility of these reagents to override 
genetic redundancy. 

30 Example XIV. VaUdation of Multigene Targeting Using G6PD, GAPDH, PLK, 
and UQC. 

Further demonstration of the ability to knock down expression of multiple 
genes using rationally designed siRNA was performed using pools of siRNA directed 
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against four separate genes. To achieve this, siRNA were transfected into cells (total 
siRNA concentration of lOOnM) and assayed twenty-four hours later by B-DNA. 
Results shown in Figure 24 show that pools of rationally designed molecules are 
capable of simultaneously silencing four different genes. 

5 

Example XV. Validation of Multigene Knockouts As Demonstrated by Gene 

Expression Profiling, a Prophetic Example 

To fiirther demonstrate the abihty to concomitantly knockdown the expression 

of multiple gene targets, single siRNA or siRNA pools directed against a collection of 
10 genes {e.g. 4, 8, 16, or 23 diflfereht targets) are simultaneously transfected into cells 

and ciiltured for twenty-four hours. Subsequently, mRNA is harvested from treated 

(and untreated) cells and labeled with one of two fluorescent probes dyes (e.g. a red 

fluorescent probe for the treated cells, a green fluorescent probe for the control cells.). 

Equivalent amounts of labeled RNA from each sample is then mixed together and 
1 5 hybridized to sequences that have been linked to a solid support (e.g, a slide, "DNA 

CHIP"). Following hybridization, the slides are washed and analyzed to assess 

changes in the levels of target genes induced by siRNA. 

Example XVI. Identifying Hyperfuuctional siRNA 

20 

Identification of Hyperfunctional Bcl-2 siRNA 

The ten rationally designed Bcl2 siRNA (identified in Figure 13, 14) were 
tested to identify hyperpotent reagents. To accomplish this, each of the ten Bcl-2 
siRNA were individually transfected into cells at a 300pM (0.3nM) concentrations. 
25 Twenty-four hours later, transcript levels were assessed by B-DNA assays and 

compared with relevant controls. As shown in Figure 25, while the majority of Bcl-2 
siRNA failed to induce functional levels of silencing at this concentration, siRNA 1 
and 8 induced >80% silencing, and siRNA 6 exhibited greater than 90% silencing at 
this subnanomolar concentration. 

30 

By way of prophetic examples, similar assays could be performed with any of 
the groups of rationally designed genes described in Example VII or Example VIIL 
Thus for instance, rationally designed siRNA sequences directed against 
PDGFA 
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(SEQ. ID NO. 2383: GGUAAGAUAUUGUGCUUUA, 
SEQ. ID NO. 2384: CCGCAAAUAUGCAGAAUUA, 
SEQ. ID NO. 2385: GGAUGUACAUGGCGUGUUA, 
SEQ. ID NO. 2386: GGUGAAGUUUGUAUGUUUA), or 

5 

PDGFB 

(SEQ. ID NO. 2387: GCUCCGCGCUUUCCGAUUU, 
SEQ. ID NO. 2388: GAGCAGGAAUGGUGAGAUG, 
SEQ. ID NO. 2389: GAACUUGGGAUAAGAGUGU, 
10 SEQ. ID NO. 2390: CCGAGGAGCUUUAUGAGAU, 
SEQ. ID NO. 2391: UUUAUGAGAUGCXJGAGUGA) 

could be introduced into cells at increasingly limiting concentrations to determine 
whether any of the diq)lexes are hyperfiinctional. Similarly, rationally designed 
sequences directed against 
15 HIFl alpha.. 

(SEQ. ID NO. 2392: GAAGGAACCUGAUGCUUUA, 
SEQ. ID NO. 2393 : GCAUAUAUCUAGA AGGUAU, 
SEQ. ID NO. 2394: GAACAAAUACAUGGGAUUA, 
SEQ. ID NO. 2395: GGACACAGAUUUAGACUUG), or 

20 

VEGF 

(SEQ. IDNO. 2396: GAACGUACUUGCAGAUGUG, 
SEQ. ID N0..2391i<iAGAAAGCAUUUGUUUGUA, 
SEQ. ID NO. 2398: GGAGAAAGCAUUUGUUUGU, 
25 SEQ. ID NO; 2399: CGAGGCAGCUUGAGUUAAA) could be introduced into cells 
at increasingly hmiting concentrations and screened for hyperfiinctional duplexes. 

Example XVII; Gene Silencing: Prophetic Example 

Below is an example of how one might transfect a cell. 
30 a. Select a cell line. The selection of a cell line is usually determined by the 
desired application. The most important feature to RNAi is the level of 
expression of the gene of interest. It is highly recommended to use cell lines 
for which siRNA transfection conditions have been specified and validated. 
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b. Plate the cells. Approximately 24 hours prior to transfection, plate the cells at 
the appropriate density so that they will be approximately 70 - 90% confluent, 
or approximately 1 x 10^ cells/ml at the time of transfection. Cell densities 
that are too low may lead to toxicity due to excess exposure and uptake of 
transfection reagent-siRNA complexes. Cell densities that are too high may 
lead to low transfection efficiencies and little or no silencing. Incubate the 
cells overnight. Standard incubation conditions for mammalian cells are 3TC 
in 5% CO2. Other cell types, such as insect cells, requure different 
temperatures and CO2 concentrations that are readily ascertainable by persons 
skilled in the art. Use conditions appropriate for the cell type of interest. 

c. SiRNA re-suspension. Add 20 jil siRNA universal buffer to each siRNA to 
generate a final concentration of 50 jiM. 

d. SiRNA-lipid complex formation. Use RNase-firee solutions and tubes. Using 
the following table, Table XI: 
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e. 



Table XI 








Mixture 1 (TransIT-TKO-Plasmid dilution mixture) 


Opti-MEM 


9.3 ul 


46.5 Hi 


TransIT-TKO{l ng/^l) 


0.5 Hi 


2.5 Hi 















Mixture 2 (siRNA dilution mixture) 


Opti-MEM 


9.0 M 


45.0 Hi 


siRNA (1 ^M) 


1.0 ul 


5.0 ul 










Mixture 3 (siRNA-Transfection reagent mixture) 


Mixture 1 


10^1 


50 Hi 


Mixture 2 


lOixI 


50 ul 










Incubate 20 minutes at room temperature. 


Mixture 4 (Media-siRNA/Iransfection reagent mixture) 


Mixture 3 


20 ix\ 


100 ul 


Complete media 


80 ut 


400 111 










Incubate 48 hours at 3TC. 



5 Transfection. Create a Mixture 1 by combining the specified amoimts of OPTI-MEM 
serum free media and transfection reagent in a sterile polystyrene tube. Create a 
Mixture 2 by combining specified amounts of each siRNA with OPTI-MEM media in 
sterile 1 ml tubes. Create a Mixture 3 by combining specified amounts of Mixture 1 
and Mixture 2. Mix gently (do not vortex) and incubate at room temperature for 20 

1 0 minutes. Create a Mixture 4 by combining specified amounts of Mixture 3 to 

complete media. Add appropriate volume to each cell culture well. Incubate cells 
with transfection reagent mixture for 24 - 72 hours at 37°C. This incubation time is 
flexible. The ratio of silencing will remain consistent at any point in the time period. 
Assay for gene silencing using an appropriate detection method such as RT-PCR, 

1 5 Western blot analysis, immunohistochemistry, phenotypic analysis, mass 



spectrometry, fluorescence, radioactive decay, or any other method that is now known 
or that comes to be known to persons skilled in the art and that from reading this 
disclosure would useful with the present invention. The optimal window for 
observing a knockdown phenotype is related to the mRNA turnover of the gene of 
5 interest, although 24 - 72 hours is standard. Final Volume reflects amount needed in 
each well for the desired cell culture format. When adjusting volumes for a Stock 
Mix, an additional 10% should be used to accommodate variability in pipetting, eta 
Duplicate or tripHcate assays should be carried out when possible. 

1 0 While the invention has been described in connection with specific 

embodiments thereof, it will be understood that it is capable of further modifications 
and this applicatioh is intended to cover any variations, uses, or adaptations of the 
invention following, in general, the principles of the invention and including such 
departure from the present disclosure as come within known or customary practice 

1 5 within the art to which the invention pertains and as may be applied to the essential 
features hereinbefore set forth and as follows in the scope of the appended claims. 
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Claims 



1 . A method for selecting siRNA comprising selecting an siRNA molecule of 
1 9 - 25 nucleoside bases, said method comprising: 
(a) selecting a target gene; 
5 (b) measuring the functionality of sequences of 1 9 - 25 nucleotides in 

length that are substantially complementary to a stretch of nucleotides 
of the target sequence, wherein said fiinctionality is dependent upon 
non-target specific criteria. 

10 2. The method according to claim 1 wherein said functionality is determined 
by applying one of the follo wing formulas: 

Formula I = -(Ga3) +(AUi5.i9) -(Tm20'c)*3 -(Gi3)*3 -(C19) +(Ai9)*2 +(A3) 
+(Uio)+(Ai4)-(U5)-<A„); 

15 

Formula II = -(GC/3) -(AU,5.i9)*3 -(G,3)*3 -(C19) +(A,9)*2 +(A3); 

Formula III = -(GC/3) +(AUi5.i9) -(Tm2ox)*3; 

20 Formula IV = -(GC/2)+(AU,5-i9)/2-(Tm2o»c)*2 -{Gi3)*3 -<C,9) +(A,9)*2 

+(A3) +(U,o)+(A,4) -<U5) -(An); 

Formula V = -(Gi3)'^3^aj^^XAi9)*2 +(A3) + (Uio)+(A,4) -(Us) -(A„); 
25 Formula VI = -(Gi3)*3 -(C19) +(A,9)*2 +(A3); 

Fonnula VII = -(GC/2) +(AUi5-i9)/2 -( Tm2o°c)*l -<G,3)*3 -(C19) +(A,9)*3 
-KA3)*3 +(Uio)/2+(Ai4)/2-(U5)/2-(A„)/2; 



30 



wherein in Formulas I - VII: 



AU15.19 = 0-5 depending on the number of A or U bases on the sense 
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strand at positions 15-19; 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its 
value is 0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise its 
5 value is 0; 

GC = the number of G and C bases in the entire sense strand; 
Tm 2o-c= 1 if the Tm is greater than 20''C; 

A3 = 1 if A is the base at position 3 on flie sense strand, otherwise its 
value is 0; 

1 0 An = 1 if A is the base at position 1 1 on the sense strand, otherwise its 

value is 0; 

Ai4 = 1 if A is the base at position 14 on tfie sense strand, otherwise its 
value is 0; 

Ai9 = 1 if A is the base at position 19 on the sense strand, otherwise its 
15 value is 0; 

U5 = 1 if U is the base at position 5 on the sense strand, otherwise its 
value is 0; 

Uio = 1 if U is the base at position 10 on the sense strand, otherwise its 
value is 0; 

20 or, 

Formula VIII: (-14)*Gu-13*Airl2iayW4*U2-10* 

9*Aio-9*U9-9*C,8.8*Gi(r7*Ui-7*Ui6-7*C,7-7*C,9 
25 +7*Un+8*A2+8*A4 +8*A5+8*C4 +9*G8 +10*A7+10*Ui8+l 1 *Ai9+ . 

ll*C9+15*Gi+ 18*A3+19*Uio-Tm-3* (GCtota!)-6*(GC,5-i9)- 
30*X; and 

Formula IX: (14.1)*A3+(14.9)*A6+(17.6)*Ai3+(24.7)*Ai9+(14.2)*U,o+(10.5)* 
30 C9+(23.9)*G,+(163)*G2+(-123)*AiiH<.193)*Ui+(-12J)*^^^ 
(-1 l)*U3+(-15.2)*U,5+(-l L3)*U,6+ (-1 1.8)*C3+(-17.4)*C6+(- 
ia5)*C7+ (-13.7)*Gi3-i-(-25.9)*Gi<r-Tm-3* (GQotaO - 6*(GC,5-i9> 
30*X 

wherein 
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Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 

A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 

A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 

A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
5 A5 = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 

A6= 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 

A7= 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 

Aio= 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 

All = 1 if A is the base at position 11 of the sense strand, otherwise its value is 0; 
10 Ai3 = 1 if A is the base at position 13 of the sense strand, othaivise its value is 0; 

Ai9 = 1 if A is the base at position 19 of the sense strand, otherwise if another base 
is present or the senise strand is only 1 8 base pairs in length, its value is 0; 

C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
15 C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 

C5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 

Ce = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
20 Ci7 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 

C18 = 1 if C is the base at position 1 8 of the sense strand, otherwise its value is 0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand^i&zO^j&JJS base pairs in length, its value is 0;. 

25 Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 

G2= 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 

Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 

Gio== 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 

Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
30 Gi9 = 1 if G is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 1 8 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
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Ua = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
Uio = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
Ui5 = 1 if U is the base at position 1 5 on the sense strand, otherwise its value is 0; 
U16 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 
Ui7 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
U18 = 1 if U is the base at position 1 8 on the sense strand, otherwise its value is 0; 



GC15-19 = the number of G and C bases within positions 15 — 19 of the sense strand 
or within positions 1 5 -1 8 if the sense strand is only 1 8 base pairs in length; 

GQotai = the number of G and C bases in the sense strand; 

Tm= 100 if the targeting site contains an inverted repeat longer then 4 base pairs, 
1 5 otherwise its value is 0; and 

X = the number of times that the same nucleotide repeats four or more times in a 
row. 

3. A method of gene-silencing comprising selecting an siRNA according to 
20 claim 2 and introducing it into a cell. 

4. The metibiod according to claim 3 wherein said introducing is by allowing 
passive uptake of the siRNA? cnt-stsssss^, 

25 5. The method according to claim 3, wherein said introducing is through the 
use of a vector. 

6. A metfiod for developing an siRNA algorithm for selecting siRNA, said 
method comprising: 
30 (a) selecting a set of siRNA; 

(b) measuring the gene silencing ability of each siRNA from said set; 

(c) determining the relative functionality of each siRNA; 

(d) determining the amount of improved functionaKty by the presence or 
absence of at least one variable selected from the group consisting of 
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the total GC content, melting temperature of the siRNA, GC content at 
positions 15-19, the presence or absence of a particular nucleotide at 
a particular position and the number of times that the same nucleotide 
repeats within a given sequence; and 
(e) developing an algorithm using the information of step (d). 

A method of selecting an siRNA with improved functionality, said method 
comprising using the algorithm of claim 6. 

A method of selecting hyperfianctional siRNA, said method comprising 
using at least one fiinctional siRNA, wherein at least one said functional 
siRNA has been selected -according to the method of claim 7 and 
measuring the silencing ability of said at least one functional siRNA, 
wherein silencing ability is measured at a concentration of less than 1 
nanomolar siRNA. 

An siRNA molecule, wherein said siRNA molecule is effective at 
silencing Bcl-2. 

The siRNA molecule of claim 9, wherein said siRNA molecule comprises 
a sequence substantially similar to a sequence selected from the group 
consisting of GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 
GAAGUACAUCCAUUAUAAGc(&BQ..IDNO. 302); 
GUACGACAACCGGGAGAUA (SEQ. ID NO. 303); 
AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 
UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 
CAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 
UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 
GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
GGAGAUAGUGAUGAAGUAC (SEQ. ID NO. 309); and 
GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 310). 

The siRNA molecule of claim 10, wherein said siRNA molecule 
comprises a sequence selected from the group consisting of 



GGGAGAUAGUGAUGAAGUA (SEQ. ID NO, 301); 
GAAGUACAUCCAUUAUAAG (SEQ. ID NO. 302); 
GUACGACAACCGGGAGAUA (SEQ. ID NO. 303); 
AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 
UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 
GCAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 
UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 
GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
GGAGAUAGUGAUGAAGUAC (SEQ. ID NO. 309); and 
GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 310). 

The siRNA molecule of claim 1 1 , wherein said siRNA molecule 
comprises GCAUGCGGCCUCUGUUUGA . 

The siRNA molecule of claim 9, wherein said siRNA molecule comprises 
a sense strand and an anti-sense strand. 

The siRNA molecule of claim 9, wherien said siRNA molecule comprises 
a hairpin. 

The siRNA molecule of claim 9, wherein said siRNA molecule comprises 
between 1 8 and 30 base pairs. 

A kit for gene silencing comprising at least one siRNA selected from the 
group consisting of sequences substantially similar to the group consisting 
of GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 
GAAGUACAUCCAUUAUAAG (SEQ. ID NO. 302); 
GUACGACAACCGGGAGAUA (SEQ. ID NO. 303); 
AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 
UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 
GCAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 
UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 
GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
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GGAGAUAGUGAUGAAGUAC (SEQ, ID NO. 309); and 
GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 310). 

1 7. A method of gene silencing comprising using the siRNA molecule of 
claim 10. 

1 8. A method of gene silencing comprising using the siRNA molecule of claim 
11. 

19. A kit, wherein said kit is comprised of at least two siRNA, wherein said at 
least two siRNA comprise a first optimized siRNA and a second optimized 
siRNA, wherein said first optimized siRNA and said second optimized 
siRNA are optimized according to one of the following formulas: 

Formula I = -(GC/3) +(AU,5.i9) -(Tm2o-c)*3 -(Gi3)*3 -(C,9) +(Ai9)*2 -KAj) 
+(Uio)+(A,4)-{U5)-<A„); 

Formula II = -(GOS) -(AU,5-i9)*3 -(G,3)*3 -(Cs) +(A,9)*2 +(A3); 
Formula III = -(0073) -KAU15-19) -(Tm2o-c)*3; 

Formula IV = -(GC/2)4<AU,5.i9)/2-(Tm2o»c)*2 -(Gi3)*3 -(0,9) -KAi9)*2 
+(A3) +(Uio)+(A,4)-(Us)-(Au); 

Formula V = -(G,3)*3 -(C19) +(A,9)*2 +(A3) + (U,o)+(A,4) -(U5) -(An); 
Formula VI = -(Gi3)*3 -(C,9) +(A,9)*2 +(A3); 

Formula VU = -{002) +(AU,5-i9)/2 -( Tm2(^)*l -(Gi3)*3 -(€,9) -KAi9)*3 
+(A3)*3 +(Uio)/2-KA,4)/2 -(U5)/2 -{An)/2; 



wherein in Foimidas I - VII: 



AUi5.i9 = 0-5 depending on the number of A or U bases on the sense 

strand at positions 15-19; 
Gi3= 1 if G is the base at position 13 on the sense strand, otherwise its 

value is 0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise its 
value is 0; 

GC= the number of G and C bases in the entire sense strand; 
Tm 20-c = 1 if the Tm is greater than 20°C; 

A3 = 1 if A is the base at position 3 on the sense strand, otherwise its 
value is 0; 

Ai i^-?l if A is the base at position 1 1 on the sense strand, otherwise its 
value is 0; 

Ai4= 1 if A is the base at position 14 on the sense strand, otherwise its 
value is 0; 

Ai9 = 1 if A is the base at position 19 on the sense strand, otherwise its 
value is 0; 

Us = 1 if U is the base at position 5 on the sense strand, otherwise its 
value is 0; 

Ui'6= 1 if U is the base at position 10 on the sense strand, otherwise its 
value is 0; 

or, 



Formula VIII: (-14)*G,3-13*Ai-12*U7-ll*U2-10*A„-10.*U4-10*C3-10*C5-10*C6- 
9*A,o-9*U9-9*C,8-8*G,o-7*U,-7*U,6-7*C,7-7*C,9 
+7*U,7+8*A2+8*A4 +8*A5+8*C4 +9*G8 +10*A7+10*Ui8+l 1 *A,9+ 
11*C9+15*G,+ 18*A3+19*Uio-Tm-3* (GC,ou.i)-6*(GC,5-i9)- 
30*X; and 

FonnulalX: (14.1)*A3-K14.9)*A6+(17.6)*A,3+(24.7)*A,9+(14.2)*Uio+(10.5)* 
C9+(23.9)*G,+(16.3)*G2+(-12.3)*Aii+(-19.3)*U,+(-12.1)*U2+ 
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(-1 1)*U3-K-1 5.2)*Ui5+(-l 1 .3)*Ui6 + (-1 1 .8)*C3+M7.4)*C6+(- 
I0.5)*C7+ (43.7)*Gi3+(-25.9)*Gir-Tm-3* (GCtotaO - 6*(GC,5-i9> 
30*X 

wherein 

5 Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 

A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 

A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 

A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 

As = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
0 A6 = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 

A7 = 1 if A is the base at position 7 of fee sense strand, otherwise its value is 0; 

Aio= 1 if A isfee base at position 10 of the sense'strand, otherwise its value is 0; 

An = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 

Ai3 = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 
5 Ai9 = 1 if A is the base at position 1 9 of the sense strand, otherwise if another base 
is present or the sense strand is only 18 base pairs in length, its value is 0; 

C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 

C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 
0 C5 = 1 if C is the base at position 5 of the sense strand, oflierwise its value is 0; 

Q = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C9 = 1 if C is the base at position 9 of the sense strand, otiierwise^ its value is 0; 

C17 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
5 C18 = 1 if C is the base at position 1 8 of the sense strand, otherwise its value is 0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 18 base pairs in length, its value is 0; 

Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
0 G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
Gio= 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
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Gi9 = I if G is the base at position 1 9 of the sense strand, otherwise if another base 
is present or the sense strand is only 1 8 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
Uio = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
Ui5 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 
U16 = 1 if tJis the base at position 16 on the sense strand, otherwise its value is 0; 
Ui7 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
U18 == 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 

GC15-19 = the number of G and C bases within positions 15 - 19 of the sense strand 
or within positions 1 5 -1 8 if the sense strand is only 18 base pairs in length; 

GQotai = the number of G and C bases in the sense strand; 

Tm = 1 00 if the targeting site contains an inverted repeat longer tiien 4 base pairs, 
otherwise its value is 0; and 
X = the number of times that the same nucleotide repeats four or more times in a row. 
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Abstract 

Efficient sequence speciiSc gene silencing is possible through the use of 
siRNA technology. By selecting particular siRNAs by rationale design, one can 
maximize the generation of an effective gene silencing reagent, as well as methods for 
silencing genes. 
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Amendments to the Specification: 



Please replace Table III, spanning pages 57-63, with the following amended Table III: 



[[ TABLE III ]] 



TABLE in 


Cyclo 


I 


SEQ. ID 0032 


GUUCCAAAAACAGUGGAUA 


Cyclo 


2 


SEQ.ID0033 


UCCAAAAACAGUGGAUAAU 


Cyclo 


3 


SEQ. ID 0034 


CAAAAACAGUGGAUAAUUU 


Cyclo 


4 


SEQ. ID 0035 


AAAACAGUGGAUAAUUUUG 


Cyclo 


5 


SEQ. ID 0036 


AACAGUGGAUAAUUUUGUG 


Cyclo 


6 


SEQ. ID 0037 


CAGUGGAUAAUUUUGUGGC 


Cyclo 


7 


SEQ. ID 0038 


GUGGAUAAUUUUGUGGCCU 


Cyclo 


8 


SEQ. ID 0039 


GGAUAAUUUUGUGGCCUUA 


Cyclo 


9 


SEQ. ID 0040 


AUAAUUUUGUGGCCUUAGC 


Cyclo 


10 


SEQ. ID 0041 


AAUUUUGUGGCCUUAGCUA 


Cyclo 


11 


SEQ. ID 0042 


UUUUGUGGCCUUAGCUACA 


Cyclo 


12 


SEQ. ID 0043 


UUGUGGCCUUAGCUACAGG 


Cyclo 


13 


SEQ. ID 0044 


GUGGCCUUAGCUACAGGAG 


Cyclo 


1 A 

14 


SEQ. ID 0045 


GGCCUUAGCUACAGGAGAG 


Cyclo 


15 


SEQ. ID 0046 


CCUUAGCUACAGGAGAGAA 


Cyclo 


16 


SEQ. ID 0047 


UUAGCUACAGGAGAGAAAG 


Cyclo 


17 


SEQ. ID 0048 


AGCUACAGGAGAGAAAGGA 


Cyclo 


18 


SEQ. ID 0049 


CUACAGGAGAGAAAGGAUU 


Cyclo 


19 


SEQ. ID 0050 


ACAGGAGAGAAAGGAUUUG 


Cyclo 


20 


SEQ. ID 0051 


AGGAGAGAAAGGAUUUGGC 


Cyclo 


21 


SEQ. ID 0052 


GAGAGAAAGGAUUUGGCUA 


Cyclo 


22 


SEQ. ID 0053 


GAGAAAGGAUUUGGCUACA 


Cyclo 


23 


SEQ. ID 0054 


GAAAGGAUUUGGCUACAAA 


Cyclo 


24 


SEQ. ID 0055 


AAGGAUUUGGCUACAAAAA 


Cyclo 


25 


SEQ. ED 0056 


GGAUUUGGCUACAAAAACA 


Cyclo 


26 


SEQ. ID 0057 


AUUUGGCUACAAAAACAGC 


Cyclo 


27 


SEQ. ED 0058 


UUGGCUACAAAAACAGCAA 
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Cyclo 


28 


SEQ. ID 0059 


GGCUACAAAAACAGCAAAU 


Cyclo 


29 


SEQ, ID 0060 


CUACAAAAACAGCAAAUUC 


Cyclo 


30 


SEQ. ID 0061 


ACAAAAACAGCAAAUUCCA 


Cyclo 


31 


SEQ. ID 0062 


AAAAACAGCAAAUUCCAUC 


Cyclo 


32 


SEQ. ID 0063 


AAACAGCAAAUUCCAUCGU 


Cyclo 


33 


SEQ. ID 0064 


ACAGCAAAUUCCAUCGUGU 


Cyclo 


34 


SEQ. ID 0065 


AGCAAAUUCCAUCGUGUAA 


Cyclo 


35 


SEQ. ID 0066 


CAAAUUCCAUCGUGUAAUC 


Cyclo 


36 


SEQ. ID 0067 


AAUUCCAUCGUGUAAUCAA 


Cyclo 


37 


SEQ. ID 0068 


UUCCAUCGUGUAAUCAAGG 


Cyclo 


38 


SEQ. ID 0069 


CCAUCGUGUAAUCAAGGAC 


Cyclo 


39 


SEQ. ID 0070 


AUCGUGUAAUCAAGGACUU 


Cyclo 


40 


SEQ. ID 0071 


CGUGUAAUCAAGGACUUCA 


Cyclo 


41 


SEQ. ID 0072 


UGUAAUCAAGGACUUCAUG 


Cyclo 


42 


SEQ. ID 0073 


UAAUCAAGGACUUCAUGAU 


Cyclo 


43 


SEQ. ID 0074 


AUCAAGGACUUCAUGAUCC 


Cyclo 


44 


SEQ. ID 0075 


CAAGGACUUCAUGAUCCAG 


Cyclo 


45 


SEQ. ID 0076 


AGGACUUCAUGAUCCAGGG 


Cyclo 


46 


SEQ. ID 0077 


GACUUCAUGAUCCAGGGCG 


Cyclo 


47 


SEQ. ID 0078 


CUUCAUGAUCCAGGGCGGA 


Cyclo 


48 


SEQ. ID 0079 


UCAUGAUCCAGGGCGGAGA 


Cyclo 


49 


SEQ. ID 0080 


AUGAUCCAGGGCGGAGACU 


Cyclo 


50 


SEQ. ID 0081 


GAUCCAGGGCGGAGACUUC 


Cyclo 


51 


SEQ. ID 0082 


UCCAGGGCGGAGACUUCAC 


Cyclo 


52 


SEQ. ID 0083 


CAGGGCGGAGACUUCACCA 


Cyclo 


53 


SEQ. ID 0084 


GGGCGGAGACUUCACCAGG 


Cyclo 


54 


SEQ. ID 0085 


GCGGAGACUUCACCAGGGG 


Cyclo 


55 


SEQ. ID 0086 


GGAGACUUCACCAGGGGAG 


Cyclo 


56 


SEQ. ID 0087 


AGACUUCACCAGGGGAGAU 


Cyclo 


57 


SEQ. ID 0088 


ACUUCACCAGGGGAGAUGG 


Cyclo 


58 


SEQ. ID 0089 


UUCACCAGGGGAGAUGGCA 


Cyclo 


59 


SEQ. ID 0090 


CACCAGGGGAGAUGGCACA 


Cyclo 


SO 


SEQ. ID 0091 


CCAGGGGAGAUGGCACAGG 
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Cyclo 


61 


SEQ. ID 0092 


AGGGGAGAUGGCACAGGAG 


Cyclo 


62 


SEQ. ID 0093 


GGGAGAUGGCACAGGAGGA 


Cyclo 


63 


SEQ. ID [[0094]] 0431 


GAGAUGGCACAGGAGGAAA 


Cyclo 


64 


SEQ. ID 0095 


GAUGGCACAGGAGGAAAGA 


Cyclo 


65 


SEQ. ID 0094 


UGGCACAGGAGGAAAGAGC 


Cyclo 


66 


SEQ. ID 0096 


GCACAGGAGGAAAGAGCAU 


Cyclo 


67 


SEQ. ID 0097 


ACAGGAGGAAAGAGCAUCU 


Cyclo 


68 


SEQ. ID 0098 


AGGAGGAAAGAGCAUCUAC 


Cyclo 


69 


SEQ. ID 0099 


GAGGAAAGAGCAUCUACGG 


Cyclo 


70 


SEQ. ID 0100 


GGAAAGAGCAUCUACGGUG 


Cyclo 


71 


SEQ. ID 0101 


AAAGAGCAUCUACGGUGAG 


Cyclo 


72 


SEQ. ID 0102 


AGAGCAUCUACGGUGAGCG 


Cyclo 


73 


SEQ. ID 0103 


AGCAUCUACGGUGAGCGCU 


Cyclo 


74 


SEQ. ID 0104 


CAUCUACGGUGAGCGCUUC 


Cyclo 


75 


SEQ. ID 0105 


UCUACGGUGAGCGCUUCCC 


Cyclo 


76 


SEQ. ID 0106 


UACGGUGAGCGCUUCCCCG 


Cyclo 


77 


SEQ. ID 0107 


CGGUGAGCGCUUCCCCGAU 


Cyclo 


78 


SEQ. ID 0108 


GUGAGCGCUUCCCCGAUGA 


Cyclo 


79 


SEQ. ID 0109 


GAGCGCUUCCCCGAUGAGA 


Cyclo 


80 


SEQ. ID 0110 


GCGCUUCCCCGAUGAGAAC 


Cyclo 


81 


SEQ. ID 01 11 


GCUUCCCCGAUGAGAACUU 


Cyclo 


82 


SEQ. ID 01 12 


UUCCCCGAUGAGAACUUCA 


Cyclo 


83 


SEQ. ID 01 13 


CCCCGAUGAGAACUUCAAA 


Cyclo 


84 


SEQ. ID 01 14 


CCGAUGAGAACUUCAAACU 


Cyclo 


85 


SEQ. ID 01 15 


GAUGAGAACUUCAAACUGA 


Cyclo 


86 


SEQ. ID 01 16 


UGAGAACUUCAAACUGAAG 


Cyclo 


87 


SEQ. ID 01 17 


AGAACUUCAAACUGAAGCA 


Cyclo 


88 


SEQ. ID 0118 


AACUUCAAACUGAAGCACU 


Cyclo 


89 


SEQ. ID 01 19 


CUUCAAACUGAAGCACUAC 


Cyclo 


90 


SEQ. ID 0120 


UCAAACUGAAGCACUACGG 


DB 




SEQ. ID 0121 


ACGGGCAAGGCCAAGUGGG 


DB 


2 


SEQ. ED 0122 


CGGGCAAGGCCAAGUGGGA 


DB 


3 


SEQ. ID 0123 


GGGCAAGGCCAAGUGGGAU 
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DB 


4 


SEQ.ID0124 


GGCAAGGCCAAGUGGGAUG 


DB 


5 


SEQ.ID0125 


GCAAGGCCAAGUGGGAUGC 


DB 


6 


SEQ. ID 0126 


CAAGGCCAAGUGGGAUGCC 


DB 


7 


SEQ. ID 0127 


AAGGCCAAGUGGGAUGCCU 


DB 


8 


SEQ. ID 0128 


AGGCCAAGUGGGAUGCCUG 


DB 


9 


SEQ. ID 0129 


GGCCAAGUGGGAUGCCUGG 


DB 


10 


SEQ. ID 0130 


GCCAAGUGGGAUGCCUGGA 


DB 


11 


SEQ. ID 0131 


CCAAGUGGGAUGCCUGGAA 


DB 


12 


SEQ. ID 0132 


CAAGUGGGAUGCCUGGAAU 


DB 


13 


SEQ. ID 0133 


AAGUGGGAUGCCUGGAAUG 


DB 


14 


SEQ. ID 0134 


AGUGGGAUGCCUGGAAUGA 


DB 


15 


SEQ. ID 0135 


GUGGGAUGCCUGGAAUGAG 


DB 


16 


SEQ. ID 0136 


UGGGAUGCCUGGAAUGAGC 


DB 


17 


SEQ. ID 0137 


GGGAUGCCUGGAAUGAGCU 


DB 


18 


SEQ. ID 0138 


GGAUGCCUGGAAUGAGCUG 


DB 


19 


SEQ. ID 0139 


GAUGCCUGGAAUGAGCUGA 


DB 


20 


SEQ. ID 0140 


AUGCCUGGAAUGAGCUGAA 


DB 


21 


SEQ. ID 0141 


UGCCUGGAAUGAGCUGAAA 


DB 


22 


SEQ. ID 0142 


GCCUGGAAUGAGCUGAAAG 


DB 


23 


SEQ. ID 0143 


CCUGGAAUGAGCUGAAAGG 


DB 


24 


SEQ. ID 0144 


CUGGAAUGAGCUGAAAGGG 


DB 


25 


SEQ. ID 0145 


UGGAAUGAGCUGAAAGGGA 


DB 


26 


SEQ. ID 0146 


GGAAUGAGCUGAAAGGGAC 


DB 


27 


SEQ. ID 0147 


GAAUGAGCUGAAAGGGACU 


DB 


28 


SEQ. ID 0148 


AAUGAGCUGAAAGGGACUU 


DB 


29 


SEQ. ID 0149 


AUGAGCUGAAAGGGACUUC 


DB 


30 


SEQ. ID 0150 


UGAGCUGAAAGGGACUUCC 


DB 


31 


SEQ. ID 0151 


GAGCUGAAAGGGACUUCCA 


DB 


32 


SEQ. ID 0152 


AGCUGAAAGGGACtTLTCCAA 


DB 


33 


SEQ. ID 0153 


GCUGAAAGGGACUUCCAAG 


DB 


34 


SEQ. ID 0154 


CUGAAAGGGACUUCCAAGG 


DB 


35 


SEQ. ID 0155 


UGAAAGGGACUUCCAAGGA 


DB 


36 


SEQ. ID 0156 


GAAAGGGACUUCCAAGGAA 
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DB 


37 


SEQ. ID0157 


AAAGGGACUUCCAAGGAAG 


DB 


38 


SEQ. ID 0158 


AAGGGACUUCCAAGGAAGA 


DB 


39 


SEQ. ID 0159 


AGGGACUUCCAAGGAAGAU 


DB 


40 


SEQ. ID 0160 


GGGACUUCCAAGGAAGAUG 


DB 


41 


SEQ. ID 0161 


GGACUUCCAAGGAAGAUGC 


DB 


42 


SEQ. ID 0162 


GACUUCCAAGGAAGAUGCC 


DB 


43 


SEQ. ID 0163 


ACUUCCAAGGAAGAUGCCA 


DB 


44 


SEQ. ID 0164 


CUUCCAAGGAAGAUGCCAU 


DB 


45 


SEQ. ID 0165 


UUCCAAGGAAGAUGCCAUG 


DB 


46 


SEQ. ID 0166 


UCCAAGGAAGAUGCCAUGA 


DB 


47 


SEQ. ID 0167 


CCAAGGAAGAUGCCAUGAA 


DB 


48 


SEQ. ID 0168 


CAAGGAAGAUGCCAUGAAA 


DB 


49 


SEQ. ID 0169 


AAGGAAGAUGCCAUGAAAG 


DB 


50 


SEQ. ED 0170 


AGGAAGAUGCCAUGAAAGC 


DB 


51 


SEQ. ID 0171 


GGAAGAUGCCAUGAAAGCU 


DB 


52 


SEQ. ID 0172 


GAAGAUGCCAUGAAAGCUU 


DB 


53 


SEQ. ID 0173 


AAGAUGCCAUGAAAGCUUA 


DB 


54 


SEQ. ID 0174 


AGAUGCCAUGAAAGCUUAC 


DB 


55 


SEQ. ID 0175 


GAUGCCAUGAAAGCUUACA 


DB 


56 


SEQ. ID 0176 


AUGCCAUGAAAGCUUACAU 


DB 


57 


SEQ. ID 0177 


UGCCAUGAAAGCUUACAUC 


DB 


58 


SEQ. ID 0178 


GCCAUGAAAGCUUACAUCA 




59 


SEQ. ID 0179 


CCAUGAAAGCUUACAUCAA 


DB 


60 


SEQ. ID 0180 


CAUGAAAGCUUACAUCAAC 


DB 


61 


SEQ. ID 0181 


AUGAAAGCUUACAUCAACA 


DB 


62 


SEQ. ID 0182 


UGAAAGCUUACAUCAACAA 


r\T> 

DB 


63 


SEQ. ID 0183 


GAAAGCUUACAUCAACAAA 


DB 


54 


SEQ. ID 0184 


AAAGCUUACAUCAACAAAG 


DB 


55 


SEQ. ID 0185 


AAGCUUACAUCAACAAAGU 


DB 


56 


SEQ. ID 0186 


AGCUUACAUCAACAAAGUA 


DB 


57 ! 


SEQ. ID 0187 


3CUUACAUCAACAAAGUAG 


DB 


58 i 


SEQ. ID 0188 


::UUACAUCAACAAAGUAGA 


DB ( 


59 i 


SEQ. ID 0189 


LJUACAUCAACAAAGUAGAA 
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DB 


70 


SEQ.IDpl90 


UACAUCAACAAAGUAGAAG 


DB 


71 


SEQ.ID0191 


ACAUCAACAAAGUAGAAGA 


DB 


72 


SEQ. ID 0192 


CAUCAACAAAGUAGAAGAG 


DB 


73 


SEQ. ID 0193 


AUCAACAAAGUAGAAGAGC 


DB 


74 


SEQ. ID 0194 


UCAACAAAGUAGAAGAGCU 


DB 


75 


SEQ. ID 0195 


CAACAAAGUAGAAGAGCUA 


DB 


76 


SEQ. ID 0196 


AACAAAGUAGAAGAGCUAA 


DB 


77 


SEQ. ID 0197 


ACAAAGUAGAAGAGCUAAA 


DB 


78 


SEQ. ID 0198 


CAAAGUAGAAGAGCUAAAG 


DB 


79 


SEQ. ID 0199 


AAAGUAGAAGAGCUAAAGA 


DB 


80 


SEQ. ID 0200 


AAGUAGAAGAGCUAAAGAA 


DB 


81 


SEQ. ID 0201 


AGUAGAAGAGCUAAAGAAA 


DB 


82 


SEQ. ID 0202 


GUAGAAGAGCUAAAGAAAA 


DB 


83 


SEQ. ED 0203 


UAGAAGAGCUAAAGAAAAA 


DB 


84 


SEQ. ID 0204 


AGAAGAGCUAAAGAAAAAA 


DB 


85 


SEQ. ID 0205 


GAAGAGCUAAAGAAAAAAU 


DB 


86 


SEQ. ID 0206 


AAGAGCUAAAGAAAAAAUA 


DB 


87 


SEQ. ID 0207 


AGAGCUAAAGAAAAAAUAC 


DB 


88 


SEQ. ID 0208 


GAGCUAAAGAAAAAAUACG 


DB 


89 


SEQ. ID 0209 


AGCUAAAGAAAAAAUACGG 


DB 


90 


SEQ. ID 0210 


GCUAAAGAAAAAAUACGGG 


Luc 


1 


SEQ. ID 0211 


AUCCUCAUAAAGGCCAAGA 


Luc 


2 


SEQ. ID 0212 


AGAUCCUCAUAAAGGCCAA 


Luc 


3 


SEQ. ID 0213 


AGAGAUCCUCAUAAAGGCC 


Luc 


4 


SEQ. ID 0214 


AGAGAGAUCCUCAUAAAGG 


Luc 


5 


SEQ. ID 0215 


UCAGAGAGAUCCUCAUAAA 


r 

Luc 


6 


SEQ. ID 0216 


AAUCAGAGAGAUCCUCAUA 


Luc 


7 


SEQ. ID 0217 


AAAAUCAGAGAGAUCCUCA 


Luc 


B 


SEQ. ID 0218 


GAAAAAUCAGAGAGAUCCU 


Luc 


? 


SEQ. ID 0219 


AAGAAAAAUCAGAGAGAUC 


Luc 


10 


SEQ. ID 0220 


GCAAGAAAAAUCAGAGAGA 


Luc 


11 


SEQ. ID 0221 


ACGCAAGAAAAAUCAGAGA 


Luc 


12 


SEQ. ID 0222 


CGACGCAAGAAAAAUCAGA 
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Luc 


13 


SEQ. ID 0223 


CUCGACGCAAGAAAAAUCA 


Luc 


14 


SEQ. ID 0224 


AACUCGACGCAAGAAAAAU 


Luc 


15 


SEQ. ID 0225 


AAAACUCGACGCAAGAAAA 


Luc 


16 


SEQ. ID 0226 


GGAAAACUCGACGCAAGAA 


Luc 


17 


SEQ. ID 0227 


CCGGAAAACUCGACGCAAG 


Luc 


18 


SEQ. ID 0228 


UACCGGAAAACUCGACGCA 


Luc 


19 


SEQ. ID 0229 


CUUACCGGAAAACUCGACG 


Luc 


20 


SEQ. ID 0230 


GUCUUACCGGAAAACUCGA 


Luc 


21 


SEQ. ID 0231 


AGGUCUUACCGGAAAACUC 


Luc 


22 


SEQ. ID 0232 


AAAGGUCUUACCGGAAAAC 


Luc 


23 


SEQ. ID 0233 


CGAAAGGUCUUACCGGAAA 


Luc 


24 


SEQ. ID 0234 


ACCGAAAGGUCUUACCGGA 


Luc 


25 


SEQ. ID 0235 


GUACCGAAAGGUCUUACCG 


Luc 


26 


SEQ. ID 0236 


AAGUACCGAAAGGUCUUAC 


Luc 


27 


SEQ. ID 0237 


CGAAGUACCGAAAGGUCUU 


Luc 


28 


SEQ. ID 0238 


GACGAAGUACCGAAAGGUC 


Luc 


29 


SEQ. ID 0239 


UGGACGAAGUACCGAAAGG 


Luc 


30 


SEQ. ED 0240 


UGUGGACGAAGUACCGAAA 


Luc 


31 


SEQ. ID 0241 


UUUGUGGACGAAGUACCGA 


Luc 


32 


SEQ. ID 0242 


UGUUUGUGGACGAAGUACC 


Luc 


33 


SEQ. ID 0243 


UGUGUUUGUGGACGAAGUA 


Luc 


34 


SEQ. ID 0244 


GUUGUGUUUGUGGACGAAG 


Luc 


35 


SEQ. ID 0245 


GAGUUGUGUUUGUGGACGA 


Luc 


36 


SEQ. ID 0246 


AGGAGUUGUGUUUGUGGAC 


Luc 


37 


SEQ. ID 0247 


GGAGGAGUUGUGUUUGUGG 


Laic 


38 


SEQ. ID 0248 


GCGGAGGAGUUGUGUUUGU 


Luc 


39 


SEQ. ID 0249 


GCGCGGAGGAGUUGUGUUU 


Luc 


40 


SEQ. ID 0250 


(JUGCGCGGAGGAGUUGUGU 


Luc 


41 


SEQ. ID 0251 


AGUUGCGCGGAGGAGUUGU 


Luc 




SEQ. ID 0252 


AAAGUUGCGCGGAGGAGUU 


Luc 


« 


SEQ. ID 0253 


AAAAAGUUGCGCGGAGGAG 


Luc 


44 


SEQ. ID 0254 


CGAAAAAGUUGCGCGGAGG 


Luc 


is 


SEQ. ID 0255 


CGCGAAAAAGUUGCGCGGA 
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Luc 


46 


SEQ. ED 0256 


ACCGCGAAAAAGUUGCGCG 


Luc 


47 


SEQ.ID0257 


CAACCGCGAAAAAGUUGCG 


Luc 


48 


SEQ. ID 0258 


AACAACCGCGAAAAAGUUG 


Luc 


49 


SEQ. ID 0259 


GUAACAACCGCGAAAAAGU 


Luc 


50 


SEQ. ID 0260 


AAGUAACAACCGCGAAAAA 


Luc 


SI 


SEQ. ID 0261 


UCAAGUAACAACCGCGAAA 


Luc 


52 


SEQ. ID 0262 


AGUCAAGUAACAACCGCGA 


Luc 


53 


SEQ. ID 0263 


CCAGUCAAGUAACAACCGC 


Luc 


54 


SEQ. ID 0264 


CGCCAGUCAAGUAACAACC 


Luc 


55 


SEQ. ID 0265 


GUCGCCAGUCAAGUAACAA 


Luc 


56 


SEQ. ID 0266 


ACGUCGCCAGUCAAGUAAC 


Luc 


57 


SEQ. ID 0267 


UUACGUCGCCAGUCAAGUA 


Luc 


58 


SEQ. ID 0268 


GAUUACGUCGCCAGUCAAG 


Luc 


59 


SEQ. ID 0269 


UGGAUUACGUCGCCAGUCA 


Luc 


60 


SEQ. ID 0270 


CGUGGAUUACGUCGCCAGU 


Luc 


61 


SEQ. ID 0271 


AUCGUGGAUUACGUCGCCA 


Luc 


62 


SEQ. ID 0272 


AGAUCGUGGAUUACGUCGC 


Luc 


63 


SEQ. ED 0273 


AGAGAUCGUGGAUUACGUC 


Luc 


64 


SEQ. ED 0274 


AAAGAGAUCGUGGAUUACG 


Luc 


65 


SEQ. ID 0275 


AAAAAGAGAUCGUGGAUUA 


Luc 


66 


SEQ. ID 0276 


GGAAAAAGAGAUCGUGGAU 


Luc 


67 


SEQ. ID 0277 


ACGGAAAAAGAGAUCGUGG 


Luc 


68 


SEQ. ID 0278 


UGACGGAAAAAGAGAUCGU 


Luc 


69 


SEQ. ID 0279 


GAUGACGGAAAAAGAGAUC 


Luc 


70 


SEQ. ID 0280 


ACGAUGACGGAAAAAGAGA 


Luc 


71 


SEQ. ID 0281 


AGACGAUGACGGAAAAAGA 


Luc 


72 


SEQ. ID 0282 


AAAGACGAUGACGGAAAAA 


Luc 


73 


SEQ. ID 0283 


GGAAAGACGAUGACGGAAA 


Luc 


74 


SEQ. ID 0284 


ACGGAAAGACGAUGACGGA 


Luc 


75 


5EQ. ID 0285 


GCACGGAAAGACGAUGACG 


Luc 


76 


SEQ. ID 0286 


3AGCACGGAAAGACGAUGA 


Luc 


77 I 


SEQ. ID 0287 


UGGAGCACGGAAAGACGAU 


Luc 


78 


SEQ. ID 0288 


UUUGGAGCACGGAAAGACG 
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Luc 


79 


SEQ. ID 0289 


GUUUUGGAGCACGGAAAGA 


Luc 


80 


SEQ. ID 0290 


UUGUUUUGGAGCACGGAAA 


Luc 


81 


SEQ. ID 0291 


UGUUGUUUUGGAGCACGGA 


Luc ' 


82 


SEQ. ID 0292 


GUUGUUGUUUUGGAGCACG 


Luc 


83 


SEQ. ID 0293 


CCGUUGUUGUUUUGGAGCA 


Luc 


84 


SEQ. ID 0294 


CGCCGUUGUUGUUUUGGAG 


Luc 


85 


SEQ. ID 0295 


GCCGCCGUUGUUGUUUUGG 


Luc 


86 


SEQ. ID 0296 


CCGCCGCCGUUGUUGUUUU 


Luc 


87 


SEQ. ID 0297 


UCCCGCCGCCGUUGUUGUU 


Luc 


88 


SEQ. ID 0298 


cuucccgccgcx:guuguug 


Luc 


89 


SEQ. ID 0299 


AACUiJCCCGCCGCCGUUGU 


Luc 


90 


SEQ. ID 0300 


UGAACUUCCCGCCGCCGUU 



Please replace Table V, spanning pages 57-63, with the following amended Table V: 



[[ TABLE V ]] 



TABLE V 


Gene 
Name 


Accession 
Number 


SEQ. ID NO. FTllSeqTence 


Formula 

vin 


Formula 
DC 


CLTC 


^^M 004859 


SEQ. ID NO. [[0301]] 

2400 


GAAAGAATCTGTAGAGAAA 


76 


94.2 


CLTC 


NM 004859 


SEQ. ID NO. [[0302]] 
2401 


GCAATGAGCTGTTTGAAGA 


65 


39.9 


CLTC 


NM 004859 


SEQ. ID NO. [[0303]] 
2402 


rGACAAAGGTGGATAAATT 


57 


38.2 


CLTC 


NM 004859 


SEQ. ID NO. [[0304]] 

2403 


GGAAATGGATCTCTTTGAA 


54 


49.4 


CLTA 


NM_001833 


SEQ. ID NO. [[0305]] 
2404 


GGAAAGTAATGGTCCAACA 


22 


55.5 


CLTA 


NM 001833 


SEQ. ED NO. [[0306]] 
2405 


AGACAGTTATGCAGCTATT 


4 


22.9 
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CLTA 


NM_001833 


SEQ.ro NO. [[0307]] 
2406 


CCAATTCTCGGAAGCAAGA 


1 


17 


CLTA 


NM_001833 


SEQ.ro NO. [[0308]] 
2407 


GAAAGTAATGGTCCAACAG 


-I 


-13 


CLTB 


NM_001834 


SEQ.ro NO. [[0309]] 
2408 


GCGCCAGAGTGAACAAGTA 


17 


57.5 


CLTB 


NM_001834 


SEQ.ro NO. [[0310]] 
2409 


GAAGGTGGCCCAGCTATGT 


15 


-8.6 


CLTB 


NM_001834 


SEQ.ro NO. 03 11 


GGAACCAGCGCCAGAGTGA 


13 


40.5 


CLTB 


NM_00I834 


SEQ.ro NO. 0312 


GAGCGAGATTGCAGGCATA 


20 


61.7 


CALM 


U45976 


SEQ.ro NO. 0313 


GTTAGTATCTGATGACTTG 


36 


-34.6 


CALM 


U45976 


SEQ.ro NO. 0314 


GAAATGGAACCACTAAGAA 


33 


46.1 


CALM 


U45976 


SEQ.ro NO. 0315 


GGAAATGGAACCACTAAGA 


30 


61.2 


CALM 


U45976 


SEQ.ro NO. 0316 


CAACTACACTTTCCAATGC 


28 


6.8 


EPS15 


NM_001981 


SEQ.ro NO. 0317 


CCACCAAGATTTCATGATA 


48 


25.2 


EPS 15 


NM_001981 


SEQ.ro NO. 0318 


GATCGGAACTCCAACAAGA 


43 


49.3 


EPS 15 


NM_001981 


SEQ.ro NO. 0319 


AAACGGAGCTACAGATTAT 


39 


11.5 


EPS 15 


NM_001981 


SEQ. ro NO. 0320 


CCACACAGCATTCTTGTAA 


33 


-23.6 


EPS15R 


NM_021235 


SEQ.ro NO. 0321 


GAAGTTACCTTGAGCAATC 


48 


33 


EPS15R 


NM_021235 


SEQ.ro NO. 0322 


GGACTTGGCCGATCCAGAA 


27 


33 


EPS15R 


NM_021235 


SEQ. ro NO. 0323 


GCACTTGGATCGAGATGAG 


20 


1.3 


EPS15R 


NM_021235 


SEQ. ro NO. 0324 


CAAAGACCAATTCGCGTTA 


17 


27.7 


DNM2 


NM_004945 


SEQ. ro NO. 0325 


CCGAATCAATCGCATCTTC 


6 


-29.6 


DNM2 


NM_004945 


SEQ. ID NO. 0326 


GACATGATCCTGCAGTTCA 


5 


-14 


DNM2 


NM_004945 


SEQ. ro NO. 0327 


GAGCGAATCGTCACCACTT 


5 


24 


DNM2 


NM_004945 


SEQ. ID NO. 0328 


CCTCCGAGCTGGCGTCTAC 


-4 


-63.6 


ARF6 


AF93885 


SEQ. ro NO. 0329 


rCACATGGTTAACCTCTAA 


27 


-21.1 


ARF6 


AF93885 


SEQ. ID NO. 0330 


GATGAGGGACGCCATAATC 


7 


-38.4 


ARF6 


AF93885 


SEQ.ro NO. 0331 


CCTCTAACTACAAATCTTA 


4 


16.9 


ARF6 


AF93885 


SEQ. ro NO. 0332 


GGAAGGTGCTATCCAAAAT 


4 


11.5 


RAB5A 


BC001267 


SEQ. ID NO. 0333 


GCAAGCAAGTCCTAACATT 


40 


25.1 


RAB5A 


BC001267 


SEQ. ro NO. 0334 


GGAAGAGGAGTAGACCTTA 


17 


50.1 


RAB5A 


BC001267 


SEQ. ro NO. 0335 


AGGAATCAGTGTTGTAGTA 


16 


U.5 


RAB5A 


BC001267 


SEQ.ro NO. 0336 


GAAGAGGAGTAGACCTTAC 


12 


7 
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EIAB5B 


NM_002868 


SEQ. ID NO, 0337 


GAAAGTCAAGCCTGGTATT 


1^ 


\ 18.1 


RAB5B 


NM002868 


SEQ. ID NO. 0338 


AAAGTCAAGCCTGGTATTA 


f 


) -17.8 


RAB5B 


NM_002868 


SEQ. ID NO. 0339 


GCTATGAACGTGAATGATC 


3 


1 -21.1 


EIAB5B 


NM_002868 


SEQ. ID NO. 0340 


CAAGCCTGGTATTACGTTT 


-■) 


' -37.5 


RAB5C 


AF141304 


SEQ. ID NO. 0341 


GGAACAAGATCTGTCAATT 


3S 


51.9 


RAB5C 


AF141304 


SEQ. ID NO. 0342 


GCAATGAACGTGAACGAAA 


29 


43.7 


RAB5C 


AF141304 


SEQ. ID NO- 0343 


CAATGAACGTGAACGAAAT 


18 


43.3 


EIAB5C 


AF141304 


SEQ. .ID NO. 0344 


GGACAGGAGCGGTATCACA 


6 


18.2 


EEAl 


XM_018197 


SEQ. ID NO. 0345 


AGACAGAGCTTGAGAATAA 


67 


64.1 


EEAl 


XM_018197 


SEQ. ID NO. 0346 


GAGAAGATCTTTATGCAAA 


60 


48.7 


EEAl 


XM_018197 


SEQ. ID NO. 0347 


GAAGAGAAATCAGCAGATA 


58 


45.7 


EEAl 


XM:^018197 


SEQ. ID NO. 0348 


GCAAGTAACTCAACTAACA 


56 


72.3 


AP2B1 


NM_001282 


SEQ. ID NO. 0349 


GAGCTAATCTGCCACATTG 


49 


-12.4 


AP2B1 


NM_001282 


SEQ. ID NO. 0350 


GCAGATGAGTTACTAGAAA 


44 


48.9 


AP2B1 


NM_00I282 


SEQ. ID NO. 0351 


CAACTTAATTGTCCAGAAA 


41 


28.2 


AP2B1 


NM_001282 


SEQ. ID NO. 0352 


CAACACAGGATTCTGATAA 


33 


-5.8 


PLK 


NM_00503O 


SEQ. ID NO. 0353 


AGATTGTGCCTAAGTCTCT 


-35 


-3.4 


PLK 


NM_005030 


SEQ. ID NO. 0354 


ATGAAGATCTGGAGGTGAA 


0 


-4.3 


PLK 


NM_005030 


SEQ. ID NO. 0355 


TTTGAGACTTCTTGCCTAA 


-5 


-27.7 


PLK 


NM_O0503O 


SEQ. ID NO. 0356 


AGATCACCCTCCTTAAATA 


15 


72.3 


CrAPDH 


NM_002046 


SEQ. ID NO. 0357 


CAACGGATTTGGTCGTATT 


27 


-2.8 


CjAtDH 


!SIM_002046 


SEQ. ID NO. 0358 


GAAATCCCATCACCATCTT 


24 


3.9 


uAruri 


NM_002046 


SEQ. ID NO. 0359 


gacctcaactacatggttt 


22 


-22.9 


vjrArUrl 


NM002046 


SEQ. ID NO, 0360 


rGGTTTACATGTTCCAATA 


9 


9.8 


c-Myc 




SEQ. ID NO. 0361 


GAAGAAATCGATGTTGTTT 


31 


-11.7 


c-Myc 




SEQ. ID NO. 0362 


A.CACAAACTTGAACAGCTA 


22 


51.3 


c-Myc 




^ED ID NO VfKf:^ 




Is 


26 


c-Myc 


■ 


SEQ. ID NO. 0364 


3AAACGACGAGAACAGTTG 


18 


-8.9 


MAP2K1 


snV[_002755 ! 


5EQ. ID NO. 0365 


jCACATGGATGGAGGTTCT 


26 


16 


MAP2K1 1 


^M_002755 ! 


5EQ. ID NO. 0366 


jCAGAGAGAGCAGATTTGA 


16 


0.4 


MAP2K1 I 


^^002755 \ 


SEQ. ID NO. 0367 ( 


3AGGTTCTCTGGATCAAGT 


14 


15.5 


MAP2K1 I 


^JM_^002755 5 


5EQ.IDNO.0368 ( 


jAGCAGATTTGAAGCAACT 


14 


18.5 


MAP2K2 1 


^IM_030662 5 


5EQ.IDNO.0369 ( 


:AAAGACGATGACXrCGAA 


37 


26.4 
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MAP2K2 


NM_^030662 


SEQ. ID NO. 0370 


GATCAGCATTTGCATGGAA 


2^ 


-0.7 


MAP2K2 


NM_030662 


SEQ. ID NO. 0371 


rCCAGGAGTTTGTCAATAA 


r 


J -4.5 


MAP2K2 


NM__030662 


SEQ. ID NO. 0372 


GGAAGCTGATCCACCTTGA 


le 


» 59.2 


ECNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0373 


GCAGAAATCTAAGGATATA 


53 


35.8 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0374 


CAACAAGGATGAAGTCTAT 


5C 


18.3 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0375 


CAGCAGAAATCTAAGGATA 


41 


32.7 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0376 


CTAGATGGCTTTCTCAGTA 


39 


3.9 


CyclophilinA_ 


NM_021130 


SEQ. ID NO. 0377 


AGACAAGGTCCCAAAGACA 


-16 


58.1 


CyclophilinA_ 


NM_021130 


SEQ. ID NO. 0378 


GGAATGGCAAGACCAGCAA 


-6 


36 


CyclophilinA_ 


NM_021130 


SEQ. ID NO. 0379 


AGAATTATTCCAGGGnTA 


-3 


16.1 


CyclophilinA_ 


NM_021130 


SEQ. ID NO. 0380 


GCAGACAAGGTCCCAAAGA 




8.9 


LAMIN A/C 


NM__170707 


SEQ, ID NO. 0381 


AGAAGCAGCTTCAGGATGA 


31 


38.8 


LAMIN A/C 


NM_1 70707 


SEQ. ID NO. 0382 


GAGCTTGACTTCCAGAAGA 


33 


22.4 


LAMIN A/C 


NM_1 70707 


SEQ. ID NO. 0383 


CCACCGAAGTTCACCCTAA 


21 


27.5 


LAMIN A/C 


NM_^1 70707 


SEQ. ID NO. 0384 


GAGAAGAGCTCX^TCCATCA 


55 


30.1 


CyclophilinB 


M60857 


SEQ. ID NO. 0385 


GAAAGAGCATCTACGGTGA 


41 


83.9 


CyclophilinB 


M60857 


SEQ. ID NO. 0386 


GAAAGGATTTGGCTACAAA 


53 


59.1 


CyclophilinB 


M60857 


SEQ. ID NO. 0387 


ACAGCAAATTCCATCGTGT 


-20 


28.8 


CyclophilinB 


M60857 


f 

SEQ. ID NO. 0388 


GGAAAGACTGTTCCAAAAA 


2 


27 


DBIl 


NM_020548 


SEQ. ID NO. 0389 


CAACACGCCTCATCCTCTA 


27 


-7.6 


DBI2 


NM_^020548 


SEQ. ID NO. 0390 


CATGAAAGCTTACATCAAC 


25 


-30.8 


DBI3 


NM_020548 


SEQ. ID NO. 0391 


AAGATGCCATGAAAGCTTA 


17 


22 


DBI4 


ISIM_020548 


SEQ. ID NO. 0392 


GCACATACCGCCTGAGTCT 


15 


3.9 


rLUCl 




SEQ. ID NO. 0393 


GATCAAATCTGAAGAAGGA 


57 


49.2 


rLUC2 




SEQ. ID NO. 0394 


aCCAAGAAGTTTCCTAATA 


50 


13.7 


rLUC3 




SEQ. ID NO. 0395 


CAGCATATCTTGAACCATT 


41 


-2.2: 


rl T ICd 




stt^. lu INU. Oiyo 


A A AAA AAA ^^^^^^ A rry^^ A 

jAACAAAGGAAACGGATGA 


39 


29.2 


SeAPl ! 


^_031313 ! 


SEQ. ID NO. 0397 


"GGAAACGGTCCAGGCTAT 


6 


26.9 


SeAP2 I 


^^031313 J 


5EQ. ID NO. 0398 


3CTTCGAGCAGACATGATA 


4 


-11.2 


SeAP3 I 


vfM_031313 i 


5EQ. ID NO. 0399 ( 


:CTACACGGTCCTCCTATA 


4 


4.9 


SeAP4 I 


^_031313 J 


JEQ. IDNO.0400 ( 


jCCAAGAACCTCATCATCT 


1 


-9.9 


fLUCl 




>EQ.IDNO.0401 ( 


3ATATGGGCTGAATACAAA 


54 


40.4 


fLUC2 




SEQ. ID NO. 0402 ( 


jCACTCTGATTGACAAATA 


47 


54.7 
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fLUC3 




SEQ.IDNO. 0403 


rGAAGTCTCTGATTAAGTA 


46 


34.5 


fLUC4 




SEQ.IDNO.0404 


TCAGAGAGATCCTCATAAA 


40 


11.4 


mCyclo_l 


NM_008907 


SEQ.IDNO. 0405 


GCAAGAAGATCACCATTTC 


52 


46.4 


mCycIo_2 


NM_008907 


SEQ.IDNO. 0406 


GAGAGAAATTTGAGGATGA 


36 


70.7 


mCycloS 


NM_008907 


SEQ.IDNO. 0407 


GAAAGGATTTGGCTATAAG 


35 


-1.5 


iiiCyclo_4 


NM_008907 


SEQ.IDNO. 0408 


GAAAGAAGGCATGAACATT 


27 


10.3 


BCL2_1 


NM_000633 


SEQ.IDNO. 0409 


GGGAGATAGTGATGAAGTA 


21 


72 


BCL2_2 


NM_000633 


SEQ.IDNO. 0410 


GAAGTACATCCATTATAAG 


1 


3.3 


BCL2_3 


NM_000633 


SEQ.IDNO. 0411 


GTACGACAACCGGGAGATA 


1 


35.9 


BCL2_4 


NM_000633 


SEQ. IDNO.0412 


AGATAGTGATGAAGTACAT 


-12 


22.1 


BCL2_5 


NM_000633 


SEQ. ID NO. 0413 


TGAAGACTCTGCTCAGTTT 


36 


19.1 


BCL2_6 


NM_000633 


SEQ.IDNO. 0414 


GCATGCGGCCTCTGTTTGA 


5 


-9.7 


QBl 


NM_003365.1 


SEQ.IDNO. 0415 


GCACACAGCUUACUACAUC 


52 


-4.8 


QB2 


NM_003365.1 


SEQ. ID NO. 0416 


GAAAUGCCCUGGUAUCUCA 


49 


22.1 


QB3 


NM_003365.1 


SEQ. ID NO. 0417 


GAAGGAACGUGAUGUGAUC 


34 


22.9 


QB4 


NM_003365.1 


SEQ. BONO. 0418 


GCACUACUCCUGUGUGUGA 


28 


20.4 


ATEl-1 


NM_00704I 


SEQ. ID NO. 0419 


gaacccagcuggagaacuu 


45 


15.5 


ATEl-2 


NM_007041 


SEQ. ID NO. 0420 


GAUAUACAGUGUGAUCUUA 


40 


12.2 


ATEl-3 


NM_007041 


SEQ. ID NO. 0421 


GUACUACGAUCCUGAUUAU 


37 


32.9 


ATEl-4 


NM_007041 


SEQ. ID NO. 0422 


GUGCCGACCUUUACAAUUU 


35 


18.2 


EGFR-1 


NM_005228 


SEQ. ID NO. 0423 


GAAGGAAACTGAATTCAAA 


68 


79.4 


EGFR-1 


NM_005228 


SEQ.IDNO. 0424 


GGAAATATGTACTACGAAA 


49 


49.5 


EGFR-1 


NM_005228 


SEQ.IDNO. 0425 


CCACAAAGCAGTGAATTTA 


41 


7.6 


EGFR-1 


NM_005228 


SEQ. ID NO. 0426 


GTAACAAGCTCACGCAGTT 


40 


25.9 



Please replace Table IX, spanning pages 124-129, with the following amended Table IX: 



[[Table IX]] 



TABLE IX 


Gene 
Name 


Accession # 


Gl# 


Duplex # 


Sequence 


SEQ. ID 
NO. 


AR 


NM 000044 


21322251 


D-003400-01 


GGAACTCGATCGTATCATT 


1369 


AR 


NM 000044 


21322251 


D-003400-02 


CAAGGGAGGTTACACCAAA 


1370 


AR 


NM 000044 


21322251 


D-003400-03 


TCAAGGAACTCGATCGTAT 


1371 
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AR 


NM 000044 


21322251 


D-003400-04 


GAAATGATTGCACTATTGA 


1372 














ESR1 


NM 000125 


4503602 


D-003401r01 


GAATGTGCCTGGCTAGAGA 


1373 


ESR1 


NM 000125 


4503602 


D-003401-02 


CATGAGAGCTGCCAACCTT 


1374 


ESR1 


NM 000125 


4503602 


D-003401-03 


AGAGAAAGATTGGCCAGTA 


1375 


ESR1 


NM 000125 


4503602 


D-003401-04 


CAAGGAGACTCGCTACTGT 


1376 














ESR2 


NM 001437 


10835012 


D-003402-01 


GAACATCTGCTCAACATGA 


1377 


ESR2 


NM 001437 


10835012 


D-003402-02 


GCACGGCTCCATATACATA 


1378 


ESR2 


NM 001437 


10835012 


D-003402-03 


CAAGAAGATTCCCGGCTTT 


1379 


ESR2 


NM 001437 


10835012 


D-003402-04 


GGAAATGCGTAGAAGGAAT 


1380 














ESRRA 


NM 004451 


18860919 


D-003403-01 


GGCCTTCGCTGAGGACTTA 


1381 


ESRRA 


NM 004451 


18860919 


D-003403-02 


TGAATGCACTGGTGTCTCA 


1382 


ESRRA 


NM 004451 


18860919 


D-003403-03 


GCATTGAGCCTCTCTACAT 


1383 


ESRRA 


NM 004451 


18860919 


D-003403-04 


CCAGACAGCGGGCAAAGTG 


1384 














ESRRB 


NM 004452 


22035686 


D-003404-01 


TACCTGAGCI lACAAAl 1 1 


1385 


ESRRB 


NM 004452 


22035686 


D-003404-02 


GCACTTCTATAGCGTCAAA 


1386 


ESRRB 


NM_004452 


22035686 


D-003404-03 


CAACTCCGATTCCATGTAC 


1387 


ESRRB 


NM 004452 


22035686 


D-003404-04 


GGACTCGCCACCCATGTTT 


1388 














ESRRG 


NM 001438 


4503604 


D-003405-01 


AAACAAAGATCGACACATT 


1389 


ESRRG 


NM 001438 


4503604 


D-003405-02 


TCAGGAAACTGTATGATGA 


1390 


ESRRG 


NM 001438 


4503604 


D-003405-03 


GAAGACCAGTCCAAATTAG 


1391 


ESRRG 


NM 001438 


4503604 


D-003405-04 


ATGAAGCGCTGCAGGATTA 


1392 














HNF4A 


NM 000457 


21361184 


D-003406-01 


CGACATCACTGGAGCATAT 


1393 


HNF4A 


NM 000457 


21361184 


D-003406-02 


GAAGGAAGCCGTCCAGAAT 


1394 


HNF4A 


NM 000457 


21361184 


D-003406-03 


CCAAGTACATCCCAGCTTT 


1395 


HNF4A 


NM 000457 


21361184 


D-003406-04 


GGACATGGCCGACTACAGT 


1396 














HNF4G 


NM_004133 


6631087 


D-003407-01 


GCACTGACATAAACGTTAA 


1397 


HNF4G 


NM 004133 


6631087 


D-003407-02 


ACAAAGAGATCCATGATGT 


1398 


HNF4G 


NM 004133 


6631087 


D-003407-03 


AGAGATCCATGATGTATAA 


1399 


HNF4G 


NM 004133 


6631087 


D-003407-04 


AAATGAACGTGACAGAATA 


1400 














HSAJ2425 


NM 017532 


8923776 


D-003408-01 


GAATGAATCTACACCTTTG 


1401 


HSAJ2425 


NM 017532 


8923776 


D-003408-02 


GGAAATACGTGGAGACACT 


1402 


HSAJ2425 


NM 017532 


8923776 


D-003408-03 


CCAGATAACTACGGCGATA 


1403 


HSAJ2425 


NM 017532 


8923776 


D-003408-04 


TGGCGTACCTTCTCATTGA 


1404 














NR0B1 


NM 000475 


5016089 


D-003409-01 


CAGCATGGATGATATGATG 


1405 


NR0B1 


NM 000475 


5016089 


D-003409-02 


CTGCTGAGATTCATCAATG 


1406 


NR0B1 


NM 000475 


5016089 


D-003409-03 


ACAGATTCATCGAACTTAA 


1407 


NR0B1 


NM 000475 


5016089 


D-003409-04 


GAACGTGGCGCTCCTGTAC 


1408 














NR0B2 


NM 021969 


13259502 


D-003410-01 


G AATATG CCTGCCTGAAAG 


1409 


NR0B2 


NM 021969 


13259502 


D-003410-02 


GGAATATGCCTGGGTGAAA 


1410 
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Kl DADO 




1 ozoyou^ 


r\ nnO>l 'I A AO 


rr* X A o r^Tr^ r^x A T/^T A 
Uo 1 AoULroLr 1 ijOU 1 A 1 VP 1 A 


14 1 1 




NM Oziyby 


1 o^:oyoO^: 


PI AAOyl4A f\A 


oOUA 1 1 1 Lr 1 AUoOAv 1 1 0 


14 1^ 














NmDI 


NM_021 724 


1 3430847 


V\ r\f\^ A A A £\A 

D-003411-U1 


CAACACAGGTGGCGTCATOllT 

xn 
Tj] 


1413 


|NJK 1 ul 


lNlVI__Uzl ( ii4 


A QylQAQylT 

1 o4oUo4 f 


r\ AAO/H 4 AO 


r^r^f^ Axr*oxr*xr Ar^xr^xr^xA rrxx 


i414 






1 o4oUt54 / 


U-UUo4 1 1 -Uo 


A A A xri r* Axxr^r"/^ A A A n rrx 

U AAUA 1 v?UA 1 1 UUvjAvjAAvjII i 

Til 
UJ 


1 A1 


IMiA 1 U 1 


MM 091794. 




n-on^^Ai 1-OA 


f^pf^pi 1 1 rinTrf^TTfiTTrrnT 

0\^V7 w 1 1 1 V3v/ 1 1 V^V3 1 1 V? 1 1 v^LL 1 1 

n 


141fi 
















NM 007191 


1132169Q 


n-00341 9-01 




1417 

1 "T 1 1 




NM 007191 

INIVI V/Vl 1 1^1 


1 139169Q 

1 1 \J£^ 1 w^«7 


n-00341 9-09 


GAAGAACAGATCCGGAAGA 


1418 




NM 007191 

1 ifllVi V/Vi f 1 ^ 1 


1 1391fi9Q 


n-00341 9-03 


PTAAGnAAGTGCnTGGTTT 


1419 

1 *T 1 i7 


NR1H2 


NM 007191 

l^flVI W f 1 ^ 1 


11*^91890 


n-00341 9-04 


GnXAACAGCGGCTCAAGAA 


1420 














NRIH"^ 

l^lxll l\J 


NM OO'SRQ'^ 


S0318Q9 


n-00341 3-01 


GAAnAGATCnnCCTGAAGA 


1421 

1 1 


NR1H3 


NM 00SBQ3 


5031 8Q2 


□-003413-09 


GGAGATAGTTGACTTTGCT 


1422 


NR1H3 


NM 005693 

INIVI \J\J\J\J^\J 


5031892 


n-00341 3-03 


GAGTTTGCCTTGCTCATTG 


1423 


NR1H3 


NM 0056Q3 

INIVI Vy*Jv/v»'^v-* 


5031 892 


D-0034 13-04 


TGACTTTGCTAAACAGCTA 


1424 














NR1 H4 


NM 005123 


4826979 


n-00341 4-01 


CAAGTGACCTCGACAACAA 


1425 


NR1 H4 

1^1 \ 1 1 l"T 


NM 005123 


4826979 


n-00341 4-02 


GAAAGAATTCGAAATAGTG 


1426 


NR1H4 


NM 005123 


4826979 


D-003414-03 


CAACAGACTCTTCTACATT 


1427 


NR1H4 


NM 005123 

INIVI 1 Ai.Si,' 


4826979 


n-00341 4-04 


GAACCATACTCGCAATACA 


1428 














NR1I2 


NM 003889 


11863133 


D-003415-01 


GAACCATGCTGACTTTGTA 


1429 


NR1I2 


NM 003889 


11863133 


D-003415-02 


GATGGACGCTCAGATGAAA 


1430 


NR1I2 


NM 003889 


11863133 


D-003415-03 


CAACCTACATGTTCAAAGG 


1431 


NR1i2 


NM 003889 


11863133 


D-003415-04 


CAGGAGCAATTGGCCATTA 


1432 














NR1I3 


NM 005122 


4826660 


D-003416-01 


GGAAATCTGTCACATGGTA 


1433 


NR1I3 


NM 005122 


4826660 


D-003416-02 


TCGCAGACATCAACACTTT 


1434 


NR1I3 


NM 005122 


4826660 


D-003416-03 


CCTCTTCGCTACACAATTG 


1435 


NR1I3 


NM 005122 


4826660 


D-003416-04 


GAACAGTTTGTGCAGTTTA 


1436 














NR2C1 


NM 003297 


4507672 


D-0034 17-01 


TGACAGCACTTGATCATAA 


1437 


NR2C1 


NM 003297 


4507672 


D-003417-02 


GGAAGGAAGTGTAGACCTA 


1438 


NR2C1 


NM_003297 


4507672 


D-003417-03 


GAGCACATCTTCAAACTAC 


1439 


NR2C1 


NM 003297 


4507672 


D-003417-04 


GAAGAAATTGCACATCAAA 


1440 














NR2C2 


NM 003298 


4507674 


D-003418-01 


GAACAACGGTGACACTTCA 


1441 


NR2C2 


NM 003298 


4507674 


0,-003418-02 


CTGATGAGCTCCAACATAA 


1442 


NR2C2 


NM 003298 


4507674 


D-003418-03 


CAACCTAAGTGAATCTTTG 


1443 


NR2C2 


NM 003298 


4507674 


D-003418-04 


GAAGACACCTACCGATTGG 


1444 














NR2E1 


NM 003269 


21361108 


D-003419-01 


GATCATATCTGAAATAGAG 


1445 


NR2E1 


NM 003269 


21361108 


D-003419-02 


CAAGACTGCTTTCAGATAT 


1446 
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NR2E1 


NM 003269 


21361108 


D-003419-03 


GTTAGATGCTACTGAATTT 


1/1 47 

1 *T*tf 


NR2E1 


NM 003269 


21361108 


n-00'^41Q-04 


CAATCTATCTPTATRAAnT 
wAArt 1 vj ir^iwiv^i/Ai i 


144ft 














NR2E3 




76S7'^Q4 


n-00'^490-01 


riAfiAArinTnmTTRTriATA 

\Dr\\Dr\r\\^ \u 1 wV-/ 1 I • V7 1 OAA 1 r\ 


144Q 


NR2E3 


NM 01494Q 


7fiS7'^Q4 


n-00'^490-09 


nAAf^PAPTATnr^PATPTAT 

O/AAAOvv/Av-^ 1 /A 1 O O w/A i V> i /A 1 


14*^0 




NM 01494Q 

I^IVI \J 1 •r^'Ti/ 


7fiS7'HQ4 


n-00'^490-0'^ 


A A f5 ATPPT^i A P A P T A 

VJ/A/AOO/A 1 V-/\-/ 1 OMOV^/AV./0 1 M 


14m 








H-On '^490-04 


vJ/A/AVJO 1 1 1 1 O 1 OM 1 M 1 O 


1*fO^ 














NR9F1 


NM OO^'^ 


901 974A4 


1^-00*^491 01 


(^AA APTPTP ATPP/^PrS ATA 






NM C\ORf{RA 


901 974ft4 


0-007491-09 


TPTP ATPPr^PI^ AT ATr^TTA 


l404 






90197/lft/l 


0-00*^491 0*^ 


P A Ar: A Ar^T/^PPTP A A A^iTri 


I HDD 


NR9F1 


KIM nn^R^A 


90197Aftil 


0-00*^491-04 


(^ri A APTT A APTT* A P AP AT*^ 


140D 
















NM n9inn^ 


1414Q7A*^ 


n 00*^499 01 


f^TAPPTr^TPPr^l^ AT AT ATT 


•1 yiiz7 




MM (\o^^^f\ 


1 A1 AQ7A<% 


n 00*^499 09 


PP A APP A riPPri AP/^ A A TT 


14oc> 




MM n9inn(^ 




n 00*^499 O"^ 


APTPf^T APPT/^TPPPil^ ATA 


j4oy 




MM 091 nnc; 


1 41 AQ7ilR 


n 00'1499_04 


rSl^PPr^T ATAT^^/^P A ATTP A 


i4dU 
















NM nn'^9'^4 


90O7O1 Qft 


0-00*^49*^ 01 


Apr^ ppTr^Tr^nppTPTp a 


i40 1 




NM 00*^9 ^^4 

INIVI Wv/iJZ.O'T 


9007O1 Qft 
illiU r vi 1 y o 


n 00*^49*^ 09 


p A riPPr^fiiTfiiTPprs a a PTr:: a 


14ftO 
i4D^ 


NR^^Ffi 


MM 00^9*^4 

INIVI \J\J\J/iOH 


900701QR 

iCuu f u lyo 


0-00*^4 9'^-0 


PAAPPr^Tfi APTriPPA/^ATP 




NIR2F6 


NM 00S9'^4 


900701 QR 


0-00*^4 9^^-04 


riT A PTH P P HTPTP A A A A 
V? I AL/ 1 OV^Ov? 1 1 V^AAoAAv:? 


14R4 
















NM nnni7fi 

INIVI UuLr 1 / V? 


4*^041 '^9 


n-OO'^494-01 


r^A<^r^APAnATr:TAPPAPTA 
V3AOOAOAV3 A 1 \J 1 AVyvvAv-/ 1 A 




NR3C1 


NM 000176 


4*^041 '^9 


0-00*^494-09 


AT A Ar^ APP AT^i AfiT ATTfi 

VJrA 1 /A/Avj/AV^Vy/A 1 OAO 1 A 1 1 O 




NR3C1 


NM 000176 


45041 "^9 


n-00'^494-0'? 


RAAfiAPRATTPATTPPTTT 

VJ/A/AVJrAwVJ/A 1 1 w/A 1 1 III 




NR3C1 


NM 000176 


45041 ^^2 


n-00'^494-04 


Rf^APARATfiTAPPAPTATf^ 
















NR3C2 


NM 000901 


4505198 


D-003425-01 


GCAAACAGATGATCCAAGT 




NR3C2 


NM 000901 


4505198 


0-003425-02 


CAGCTAAGATTTATCAGAA 


1470 


NR3C2 


NM 000901 


4505198 


D-003425-03 


GCACGAAAGTCAAAGAAGT 


1471 

1 *t/ I 


NR3C2 


NM 000901 


4505198 


D-003425-04 


GGTATCCGGTCTTAGAATA 


1479 














NR4A1 


NM 002135 


21361341 


D-003426-01 


GAAGGAAGTTGTCCGAACA 


1473 


NR4A1 


NM 002135 


21361341 


D-003426-02 


CAGGAGAGTTTGACACCTT 


1474 


NR4A1 


NM 002135 


21361341 


D-003426-03 


CAGTGGCTCTGACTACTAT 


1475 


NR4A1 


NM 002135 


21361341 


D-003426-04 


GAAGGCCGCTGTGCTGTGT 


1476 














NR4A2 


NM 006186 


5453821 


D-003427-01 


GCAATGCGTTCGTGGCTTT 


1477 


NR4A2 


NM 006186 


5453821 


D-003427-02 


CGGCTACACAGGAGAGTTT 


1478 


NR4A2 


NM 006186 


5453821 


D-003427-03 


GCACGTGACTTTCAACAAT 


1479 


NR4A2 


NM 006186 


5453821 


D-003427-04 


GAATACAGCTCCGATTTCT 


1480 














NR4A3 


NM 006981 


11276070 


D-003428-01 


CAAAGAAGATCAGACATTA 


1481 


NR4A3 


NM 006981 


11276070 


D-003428-02 


GATCAGACATTACTTA7TG 


1482 


NR4A3 


NM 006981 


11276070 


D-003428-03 


CCAGAGATCTTGATTATTC 


1483 


NR4A3 


NM 006981 


11276070 


D-003428-04 


GAAGTTGTCCGTACAGATA 


1484 














NR5A1 


NM 004959 


20070192 


D-003429-01 


GATTTGAAGTTCCTGAATA 


1485 
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NR5A1 


NM 004959 


20070192 


D-003429-02 


GGAGCGAGCTGCTGGTGTT 


1486 


NR5A1 


NM 004959 


20070192 


D-003429-03 


GGAGGTGGCCGACCAGATG 


1487 


NR5A1 


NM 004959 


20070192 


D-003429-04 


CAACGTGCCTGAGCTCATC 


1488 














NR5A2 

1^1 XK^« A^B^ 


NM 003822 


20070161 


D-003430-01 


CCAAACATATGGCCACTTT 


1489 


NR5A2 


NM 003822 


20070161 


D-003430-02 


TCAGAGAACTTAAGGTTGA 


1490 


NR5A2 


NM 003822 


20070161 


D-003430-03 


GGATCCATCTTCCTGGTTA 


1491 


NR5A2 


NM 003822 


20070161 


D-003430-04 


AAGAATACCTCTACTACAA 


1492 
















NM 033334 


15451847 


D-003431-01 


C AACG AACC 1 G 1 C 1 C A 1 I I 


1493 


NR6A1 


NM 033334 


15451847 


D-003431-02 


GAAGAACTACACAGATTTA 


1494 


NR6A1 


NM 033334 


15451847 


D-003431-03 


GAAGATGGATACGCTGTGA 


1495 


NR6A1 


NM 033334 


15451847 


0-003431-04 


AAACGATACTGGTACATTT 


1496 














null 


□16815 


2116671 


D-003432-01 


GAAGAATGATCGAATAGAT 


1497 


null 


D1681S 


2116671 


D-003432-02 


GAACATGGAGCAATATAAT 


1498 


null 


D1681S 


2116671 


D-003432-03 


GAGGAGC rC r 1 GGCCI 1 lA 


1499 


null 


016815 


2116671 


D-003432-04 


TAAACAACATGCACTCTGA 


1500 














PGR 


NM 000926 


4505766 


D-003433-01 


GAGATGAGGTCAAGCTACA 


1501 


PGR 


NM 000926 


4505766 


D-003433-02 


CAGCGTTTCTATCAACTTA 

1 X^ X^ III 1 1 U IX.^ i 1 # \ 


1502 


PGR 


NM 000926 


4505766 


D-003433-03 


AGATAACTCTCATTCAGTA 


1503 


PGR 


NM 000926 


4505766 


D-003433-04 


GTAGTCAAGTGGTCTAAAT 


1504 














PPARA 


NM 005036 


7549810 


D-003434-01 


TCACGGAGCTCACGGAATT 


1505 


PPARA 


NM 005036 


7549810 


D-003434-02 


GAACATGACATAGAAGATT 


1506 


PPARA 


NM 005036 


7549810 


D-003434-03 


GGATAGTTCTGGAAGCTTT 


1507 


PPARA 


NM 005036 


7549810 


D-003434-04 


GACTCAAGCTGGTGTATGA 


1508 














PPARD 


NM 006238 


5453939 


D-003435-01 


GAGCGCAGCTGCAAGATTC 


1509 


PPARD 


NM 006238 


5453939 


D-003435-02 


GCATGAAGCTGGAGTACGA 


1510 


PPARD 


NM 006238 


5453939 


D-003435-03 


GGAAGCAGTTGGTGAATGG 


1511 


PPARD 


NM 006238 


5453939 


D-003435-04 


GCTGCAAGATTCAGAAGAA 


1512 














PPARG 


NM 138712 


20336234 


D-003436-01 


AGACTCAGCTCTACAATAA 


1513 


PPARG 


NM 138712 


20336234 


D-003436-02 


GATTGAAGCTTATCTATGA 


1514 


PPARG 


NM 138712 


20336234 


D-003436-03 


AAGTAACTCTCCTCAAATA 


1515 


PPARG 


NM 138712 


20336234 


D-003436-04 


GCATTTCTACTCCACATTA 


1516 














RARA 


NM 000964 


4506418 


D-003437'01 


GACAAGAACTGCATCATCA 


1517 


RARA 


NM 000964 


4506418 


D-003437-02 


GCAAATAGACTACGAACAA 


1518 


RARA 


NM 000964 


4506418 


D-003437-03 


GAACAACAGCTCAGAACAA 


1519 


RARA 


NM 000964 


4506418 


D-003437-04 


GAGCAGCAGTTCTGAAGAG 


1520 














RARB 


NM 000965 


14916493 


D-003438-01 


GCACACTGCTCAATCAATT 


1521 


RARB 


NM 000965 


14916493 


D-003438-02 


GCAGAAGTATTCAGAAGAA 


1522 


RARB 


NM 000965 


14916493 


D-003438-03 


GGAATGACAGGAACAAGAA 


1523 


RARB 


NM 000965 


14916493 


D-003438-O4 


GCACAGTCCTAGCATCTCA 


1524 
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RARG 


NM 000966 


21359851 


D-003439-01 


GAAATGACCGGAACAAGAA 


1525 


RARG 


NM 000966 


21359851 


D-003439-02 


TAGAAGAGCTCATCACCAA 


1526 


RARG 


NM 000966 


21359851 


D-003439-03 


CAAGGAAGCTGTGCGAAAT 


1527 


RARG 


NM 000966 


21359851 


D-003439-04 


TCAGTGAGCTGGCTACCAA 


1528 














RORA 


NM 134261 


19743902 


D-003440-01 


GGAAAGAGTTTATGTTCTA 


1529 


RORA 


NM 134261 


19743902 


D-003440-02 


CAAGATCTGTGGAGACAAA 


1530 


RORA 


NM 134261 


19743902 


D-003440-03 


GCACCTGACTGAAGATGAA 


1531 


RORA 


NM 134261 


19743902 


D-003440-04 


CCGAGAAGATGGAATACTA 


1532 














RORB 


NM 006914 


19743906 


D-003441-01 


GCACAGAACATCATTAAGT 


1533 


RORB 


NM 006914 


19743906 


D-003441-02 


CCACACCTATGAAGAAATT 


1534 


RORB 


NM 006914 


19743906 


D-003441-03 


GATCAAATTCTACTTCTGA 


1535 


RORB 


NM 006914 


19743906 


D-003441-04 


TCAAACAGATAAAGCAAGA 


1536 














RORC 


NM 005060 


19743908 


D-003442-01 


TAGAACAGCTGCAGTACAA 


1537 


RORC 


NM 005060 


19743908 


D-003442-02 


TCACCGAGGCCATTCAGTA 


1538 


RORC 


NM 005060 


19743908 


D-003442-03 


GAACAGCTGCAGTACAATC 


1539 


RORC 


NM 005060 


19743908 


D-003442-04 


CCTCATGCCACCTTGAATA 


1540 














RXRA 


NM 002957 


21536318 


D-003443-01 


TGACGGAGCTTGTGTCCAA 


1541 


RXRA 


NM 002957 


21536318 


D-003443-02 


CAACAAGGACTGCCTGATT 


1542 


RXRA 


NM 002957 


21536318 


D-003443-03 


GCAAGGACCTGACCTACAC 


1543 


RXRA 


NM 002957 


21536318 


D-003443-04 


GCAAGGACCGGAACGAGAA 


1544 














RXRB 


NM 021976 


21687229 


D-003444-01 


GCAAAGACCTTACATACTC 


1545 


RXRB 


NM 021976 


21687229 


D-003444-02 


GCAATCATTCTGTTTAATC 


1546 


RXRB 


NM 021976 


21687229 


D-003444-03 


TCACACCGATCCATTGATG 


1547 


RXRB 


NM 021976 


21687229 


D-003444-04 


GCAAACGGCTATGTGCAAT 


1548 














RXRG 


NM 006917 


21361386 


D-003445-01 


GGAAGGACCTCATCTACAC 


1549 


RXRG 


NM 006917 


21361386 


D-003445-02 


CCGGATCTCTGGTTAAACA 


1550 


RXRG 


NM 006917 


21361386 


D-003445-03 


GCGAGCCATTGTACTCTTT 


1551 


RXRG 


NM 006917 


21361386 


D-003445-04 


GAGCCATTGTACTCTTTAA 


1552 














THRA 


NM 003250 


20127451 


D-003446-01 


GGACAAAGACGAGCAGTGT 


1553 


THRA 


NM 003250 


20127451 


D-003446-02 


GGAAACAGAGGCGGAAATT 


1554 


THRA 


NM 003250 


20127451 


D-003446-03 


GTAAGCTGATTGAGCAGAA 


1555 


THRA 


NM 003250 


20127451 


D-003446-04 


GAACCTCGATCCCACCTAT 


1556 














THRB 


NM 000461 


10835122 


D-003447-01 


GAATGTCGCTTTAAGAAAT 


1557 


THRB 


NM 000461 


10835122 


D-003447-02 


GAACAGTCGTCGCCACATC 


1558 


THRB 


NM 000461 


10835122 


D-003447-03 


GGACAAGCACCAATAGTCA 


1559 


THRB 


NM 000461 


10835122 


D-003447-04 


GTGGAAAGGTTGACTTGGA 


1560 














VDR 


NM 000376 


4507882 


D-003448-01 


TGAAGAAGCTGAACTTGCA 


1561 


VDR 


NM 000376 


4507882 


D-003448-02 


GCAACCAAGACTACAAGTA 


1562 


VDR 


NM 000376 


4507882 


D-003448-03 


TCAATGCTATGACCTGTGA 


1563 


VDR 


NM 000376 


4507882 


D-003446-04 ICCATTGAGGTCATCATGTT 


1564 



Applicants: Khvorova a/. 
Serial No.: 10/714,333 
Filing Date: November 14, 2003 
Preliminary Amendment 
May 20, 2004 
Page 20 of 42 



Please replace Table X, spanning pages 129-146, with the following amended Table X: 



Table X 



TABLE X 


Gene 






oyrnuoi 






MDVyD 1 


GACCAUAAAUGUAAGGUUU 


1565 




UAGAAGAUCUGAUGUCAAA 


1566 




GAAAUGUUCACUUGAGUUA 


1567 




GAAGAUCGCUACUGAAGCA 


1568 








ABCC1 








GGAAGCAACUGCAGAGACA 


1569 




GAUGACACCUCUCAACAAA 


1570 




UAAAGUUGCUCAUCAAGUU 


1571 




CAACGAGUCUGCCGAAGGA 


1572 








ABCG2 








GCAGAUGCCUUCUUCGUUA 


1573 




AGGCAAAUCUUCGUUAUUA 


1574 




GGGAAGAAAUCUGGUCUAA 


1575 




UGACUCAUCCCAACAUUUA 


1576 








KCNH2 








CCGACGUGCUGCCUGAGUA 


1577 




GAGAAGAGCAGCGACACUU 


1578 




GAUCAUAGCACCUAAGAUA 


1579 




GCUAUUUACUGCUCUUAUU 


1580 




UCACUGGGCUCCUUUAAUU 


1581 




GUGCGAGCCUUCUGAAUAU 


1582 




GCUAAGCUAUACUACUGUA , 


1583 




UGACGGCGCUCUACUUCAC 


1584 








KCNH1 


GAGAU^MUUCCUUUGi^F 








1585 




GAAGAACGCAUGAAACGAA 


1586 




GAUAAAGACACGAUUGAAA 


. 1587 




GCUGAGAGGUCUAUUUAAA 


1588 








CLCA1 








GAACAACAAUGGCUAUGAA 


1589 




GUACAUACCUGGCUGGAUU 


1590 
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GAACAGCUCACAAGUAUAU 


1591 


GGAAACGUGUGUCUAUAUU 


1592 








GGAGGUGGGAGGACAGUUA 


1593 


UCACAGCCCUGGUGGAUGA 


1594 


GAAGCUGGCUCCUAUGUUC 


1595 


GGUCAACACUACCAACAUG 


1596 



SLC6A2 



SLC21A2 



SLC21A3 







GAACACAAGGUCAACAUUG 


1597 


AGAAGGAGCUGGCCUAGUG 


1598 


CGGAAACUCUUCACAUUUG 


1599 


CAACAAAUUUGACAACAAC 


1600 








GUACAUCUCCAUCUUAUUU 


1601 


GGAAGUGGCUGAGUUAUUA 


1602 


GAAGGGA6GCUCAAUGUAA 


1603 


GAAGGAAGUGGCUGAGUUA 


1604 








GUAGAAACAGGAGCUAUUA 


1605 


CAAGAUUACUGUCAAACAA 


1606 


1 GCACAAGAGU AUUUGGUAA 


1607 


! GCAAAUGUCCCUUCUGUAU 


1608 


; GCAUGACUCCUAUAUAAUA 


1609 


AAACAGCAAUUUCCCUUAA 


1610 


GAAAAUGCCUCUUCAGGAA 


1611 



SLC28A1 



SLC29A1 



SLC26A1 









GUUCAUCGCUCUCCUCUUU 


1612 


GGAUCAAGCUGUUUCUGAA 


1613 


GGACUGCAGUUUGUACUUG 


1614 . 


GAGUGAAACUGACCUAUGG 


1615 










GAACGCUGCUCCCGUGGAA 


1616 


GAAAGCCACUCUAUCAAAG 


1617 


GAAACCAGGUGCCUUCAGA 


' 1618 


CCUCACAGCUGUAUUCAUG 


L 1619 










CCACGGAGCUGCUGGUCAU 


1620 


■GGiSUUGACAUCyUAUUUGA 


1621 


GCACGAGGGUCUCUGUGUU 


1622 


GGCCAUCGCCUACUCAUUG 


1623 


CAACACCCAUGGCAAUUAA 


1624 


GAGGAAAGAUCUUGCUGAU 


1625 
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SLC26A2 



! GAGCAAGCGUCCUCCAAAU 


1626 


j GCAACACCCAUGGCAAUUA 


1627 




CCAAAGAACUCAAUGAACA ^^^^ 






1628 


ACAAGAACCUUCAGACUAA 


1629 


GAAGGUAGAUAGAAGAAUG 


1630 


GUAUUGAACUGUACUGUAA 


1631 



SLC4A4 







GCAAUUCUCUUCAUUUAUC 


1632 


GGAAAGAUGUCCACUGAAA 


1633 


GGACAAAGCCUUCUUCAAU 


1634 


GGAAUGGGAUCCAGCAAUU i 


1635 



GLRA1 



1 UGAAAGCCAUUG ACAUU UG 






1636 


CAGACACGCUGGAGUUUAA 


1637 


CAAUAGCGCUUUCUGGUUU 


1638 


GCAGGUAGCAGAUGGAGUA 


f 1639 



KLK1 



ADAM2 



XPNPEPl 



GZMA 



CMKLR1 







UCAGAGUGCUGUCUUAUGU 


i 1640 


CAACUUGUUUGACGACGAA 


1641 


UGACAGAGCCUGGUGAUAC 


1642 


AGGGGGCUCUGUACCAUUU 


1643 








GAAACAUGCUGUGAUAUUG 


1 1644 


GCAGAUGUUUCCUUAUAUA | 1645 


CAACAGAGAUGCCAUGAUA j 1646 


GAAAGGGGCUACAUUGAGA 


1647 








GACCUGAGCUUCCCAACAA 


1648 


GCGACUGGCUCAACAAUUA 


1649 


GAGAUUGCGUGGCUAUUUA 


1650 


GACAGCAACUGGACACUUA 


1651 








GdAAGAGACUCGUGCAAUG 


1652 


GGAACCAUGUGCCAAGUUG 


1653 


GAAGUAACUCCUCAUUCAA 


1654 


GAACUCCUAUAGAUUUCUG 


1655 








CAUAGAAGCUUUACCAAGA 


1656 


GAAUGGAGGAUGAAGAUUA 


1657 
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CLN3 



1 GGUCAAUGCUCUAAGUGAA 


1658 


1 GAGAGGACUUCUAUGAAUG 


1659 








CAUCAUGCCUUCUGAAUAA 


1660 


CAACAGCUCAUCACGAUUU 


1661 


GCAACAACUUCUCUUAUGU 


1662 


GGUCUUCGCUAGCAUCUCA 


1663 



CALCR 







GGACCUAGCUGUUGUAAAG 


1664 


GAAAGACCAUGCAUUUAAA 


1665 


GCAGGAAGAUGUAUGCUUU 


1666 


GAAUAAACCAGUAUCGUUA 


1667 



OXTR 



^^^CAGAUAUCCAAAUA 






1668 


GCAAUACUAUCCUAACUGA 


1669 


GAAUAUAGAUUAGCGUUUG 


1670 


GAUGAGGCAUGACUACUAA 


1671 








GCGAGUCUGUCCACUAUAC 


1672 


GAGAACGGCCACCCACUGA 


1673 


GAACGGCCACCCACUGAUG 


1674 


GGUCAAUGCUGCUGUGUAC 


1675 



EDG5 



EDG7 







UCCAGGAACACUAUAAUUA 


1676 


GUGACCAUCUUCUCCAUCA 


1677 


CAUCCUCUGUUGCGCCAUU 


1678 


CCAACAAGGUCCAGGAACA 


1679 








ACACUGAUACUGUCGAUGA 


1680 


! AAUAGGAGCAACACUG AUA 


1681 


CAGCAGGAGUUACCUUGUU 


1682 


GGACACCCAUGAAGCUAAU 


1683 








GCACAGAACUCCACUCAAA 


1684 


IGGACAGCAGUUCAUUGUUA 


1685 


GAGAAGAGGCUAUGUUUAA 


1686 


GGACAAACUUCGACCCUUU 


1687 










UCGCUACCCUGCUGUUAUU 


1688 


GCUACAAGAACUACCGAUA 


1689 


CAAGAAAGCUUCCUUCAAC 


1690 
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CASP3 



CASP6 



CASP7 



CASP8 



CASP9 



DVL1 



GAGAAGAAAUACAGUCAAU 


1691 






^^^^^^^^ 


CAAUAUAUCUGAAGAGCUA 


1692 


GAACUGGACUGUGGCAUUG 


1693 


GUGAGAAGAUGGUAUAUUU 


1694 


1 GAGGGUAGUUUAAGACAUA 


1695 








CAUGAGGUGUCAACUGUUA 


1696 


GAAGUGAAAUGCUUUAAUG 


1697 


AAAUAUGGCUCCUCGUUAG 


1698 


GCAAUCACAUUUAUGCAUA 


1699 


CAACAUAACUGAGGUGGAU 


1700 


CAUGGUACAUUCAAGAUUU 


1701 






GAACUCUACUUCAGUCAAU 


1704 


GGGCAAAUGCAUCAUAAUA 


1703 


CAACAGAGGGAGUUUAAUA 


1704 


GAACAAAGCCACUGACUGA 


1705 










GAAGUGAACUAUGAAGUAA 


1706 


CAACAAGGAUGACAAGAAA 


1707 


GGACAAAGUUUACCAAAUG 


1708 


GAGGGUCGAUCAUCUAUUA 


1709 


GAAUAUAGAGGGCUUAUGA 


1710 


CAACGACUAUGAAGAAUUC 


1711 


GAAGUGAGCAGAUCAGAAU 


1712 


GAGGAAAUCUCCAAAUGCA 


1713 










CCAGGCAGCUGAUCAUAGA 


1714 


UCUCAGGUGUUGCCAAAUA 


1715 


GAACAGCUGUAAUCUAUGA 


1716 


CCACUGGUCUGUAGGGAUU | 


1717 






UCGUAAAGCUGUUGAUAUC 


1718 


GAGGAGAUCUUUGAUGACA 


1719 


GUAAAGCUGUUGAUAUCGA 


1720 


GAUCGUAAAGCUGUUGAUA 


1721 




agacgaaggugauuuacca" ^ 






1722 


UGUGAGAGCUACCUAGUCA 


1723 


GAAGAAAUUUCAGAUGACA 


1724 


UAAUAGGCAUUUCCUCUUU 


1725 . 
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PTEN 



PDK1 



PDK2 



PPP2CA 



CTNNA1 



CTNNA2 



HSPCA 



DCTN2 









GUGAAGAUCUUGACCAAUG 


1726 


GAUGAGCAUACACAAAUUA 


1727 


GAAUGAACCUUGUGCAACA 


1728 


ggcgcuauguguauuauua' 


" 1729 




GUAcSAAGCUGGUAUAubc 






1730 


GAAAGACUCCCAGUGUAUA 


1731 


GGAAGUCCAUCUGAUCGAA 


1732 


'ccaaagacaugacgacguu 


1733 












GUAAAGAGGAGACUGAAUG 


1734 


GGUGUGUGAUGGUCCCUAA 


1735 


1 CAAAGAUGCOUACGACAUG 


1736 


GGGCGAUGCCUGAGGGUUA 


1737' ■ ~ 




UCAcicAAGUUUAUGGUUU 






1738 


CAACAGCCGUGACCACUUU 


'1739 


UAACCAAGCUGCAAUCAUG 


1740 


GAACUUGACGAUACUCUAA 


^ 1741 








GAAGAGAGGUCGUUCUAAG 


1742 


AAGCAGAUGUGCAUGAUUA 


1743 


UGUAAUAACUGCAGUGUUU 


1744 


GUAMGGGCCCUCUAAUAA 


1745 








GAAAGAAUAUGCCCAAGUU 


1746 


GAAGAAGAAUGCCACAAUG 


1747 


GCAGGAAGAUUAUGAUGUG 


1748 


AAAGAAAGCOCAUGUACUA 


1749 








I^GAAAGAGCUGCAUAUUA 


1750 


GCUUAGAACUCUUUAGUGA 


1751 


UAUAAGAGCUUGACCAAUG 


' 1752 


GCAGAUAUCUCUAUGAUUG 


1753 






C/b(cUCAu5cCAAUASG' 


1754 


GGAAUGAGCCAGAUGUUUA 


1755 


GGAGACAGCUGUACGUUGU 


1756 


UCGAAGAGCUGACAACUGA 


i 1757 





Applicants: Khvorova a/. 
Serial No.: 10/714,333 
Filing Date: November 14, 2003 
Preliminary Amendment 
May 20, 2004 
Page 26 of 42 



CD2 



BAD 



SMAC 



MAP3K5 



PVR 



ERBB2 



S0S1 



BRCA1 







GUAAGGAGAAGCAAUAUAA 


f 1758 


AAGAUGAGCUUUCCAUGUA 


'~ 1759 


GGACAUCUAUCUCAUCAUU 


1760 


GACAAGAGCCCACAGAGUA 


1761 








GUACUUCCCUCAGGCCUAU 


1762 


GCUGUGCCUUGACUACGUA 


1763 


GUACUUCCCUCAGGCCUAU 


1764 


GGUCAGGUGCCUCGAGAUC 


1765 








CAGCGUAACUUCAUUCUUC 


1766 


UAACUUCAUUCUUCAGGUA 


1767 


CAGCUGCUCUUACCCAUUU 


1768 


GAUUGAAGCUAUUACUGAA 


1769 


UAGAAGAGCUCCGUCAGAA 


^ 1770 


CCACAUAUGCGUUGAUUGA 


1771 


GCGCAGGGCUCUCUACCUA 


1772 








1 GAACAGCCUUCAAAUCAAA 


1773 


GAUGU UCUCUACU AUG UU A 


1774 


GCAAAUACUGGAAGGAUUA 


1775 


' CAGGAAAGCUCGUAAUUUA 


1776 



CCACACGGCUGACCUCAUA 



CAGCAGAAUUCCUCUUAUA 



GCAGAAUUCCUCUUAUAAA 



GAUCGGGAUUUAUUUCUAU 



1777 



1778 



1779 



1780 



UGUGGGAGCUGAUGACUUU 



1781 



UCACAGAGAUCUUGAAAGG 



1782 



UGGAAGAGAUCACAGGUUA 



1783 



GCUCAUCGCUCACAACCAA 



! 

.j 



1784 







GAGCAicACUUCUAUGAUU ] 


1785 


CAAAGAAGCUGUUCAAUAU 


1786 


j UGAAAGCCCUCCCUUAUUA 


1787 


[gaaauagcauggagaagga 


1788 








CCAUACAGCUUCAUAAAUA 


1789 


[GAAGAGAACUUAUCUAGUG 


1790 


GAAGUGGGCUCCAGUAUUA 


1791 
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CDKN1A 



CDKN1B 



SLC2A4 



N0S2A 



FRAP1 



FKBP1A 



TNFRSF1A 



GCAAGAUGCUGAUUCAUUA 


1792 


GAAGUGGGCUCCAGUAUUA 


1793 


"gaacggacacugaaauauu 


1794 


GCAGAUAGUUCUACCAGUA 


1795 










GAACAAGG AG UCAGAC AUU 


1796 


aaacuaggcgguugaauga 


1797 


i GAUGGAACUUCGACUUUGU 


1798 


guaaacagauggcacuuug 


1799 










ggaauggacauccuguaua 


1800 


gg agaaagaug ucaaacgu 


1801 


GAAUGGACAUCCUGUAUAA 


1802 


GUAAACAGCUCGAAUUAAG 


1803 








CAGAUAGGCUCCGAAGAUG 


1804 


AGACUCAGCUCCAGAAUAC 


1805 


GAUCGGUUCUUUCAUCUUC 


1806 


CAGGAUCGGUUCUUUCAUC 


1807 






CCAGAUAAGUGACAUAAGU 


1808 


UAAGUGACCUGCUUUGUAA 


1809 


GAAGAGAGAUUCCAUUGAA 


1810 


UGAAAGAGCUCAACAACAA 


1811 










GAGCAUGCCGUCAAUAAUA 


1812 


CAAGAGAACUCAUCAUAAG 1 


1813 


CCAAAGUGCUGCAGUACUA 


1814 


1 UAAGAAAGCUAUCCAGAUU 


1815 








IGAAACMGCCCUUUAAGUU 


1816 


GAAUUACUCUCCAAGUUGA 


1817 


fCAGCACAAGUGGUAGGUUA 


1818 


GUUGAGGACUGAAUUACUC 


1819 


Igauggcagcuguuuaaaug 


1820 


rGAGUAUCCUUUCAGUGUUA 


1821 










CAAAGGAACCUACUUGUAC 


1822 


GGAACCUACUUGUACAAUG 


1823 


GAACCUACUUGUACAAUGA 


1824 


GAGUGUGUCUCCUGUAGUA 


1825 
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IL1R1 



IRAKI 



TRAF2 



TRAF6 



TRADD 



FADD 



IKBKE 







GGACAAGAAUCAAUGGAUA 


1826 


GAACAAGCCUCCAGGAUUC 


1827 


GGACUUGUGUGCCCUUAUA 


1828 


GAACACAAAGGCACUAUAA 


1829 








CGAAGAAAGUGAUGAAUUU 


1830 


GCUCUUUGCCCAUCUCUUU 


1831 


UGAAAGACCUGGUGGAAGA 


1832 


GCAAUUCAGUUUCUACAUC 


1833 








GAAGACAGAGUUAUUAAAC 


1834 


UCACGAAGACAGAGUUAUU 


1835 


Agacagaguuauuaaacca 


1836 


CACGAAGACAGAGUUAUUA 


\ 1837 


1 GCUGAAGCCUGUCUGAUGU 


1838 






CAAAUGAUCUGAGGCAGUU 


1839 


GUUCAUAGUUUGAGCGUUA 


1840 


GGAGAAACCU6UUGUGAUU 


1841 


GGACAAAGUUGCUGAAAUC 


1842 


CAAAUGAUCUGAGGCAGUU 


1843 


GGAGAAACCUGUUGUGAUU 


1 1844 


GGACAAAGUUGCUGAAAUC 


1845 


GUUCAUAGUUUGAGCGUUA 


1846 




l^^fecACCUUGAUCuOuG 






1847 


' GGGCAGCGCAUACCUGUUU 


1848 


GAGGAGCGCUGUUUGAGUU 


1849 


GGACGAGGAGCGCUGUUUG 


1850 


GAGGAGCGCUGUUUGAGUU 


1851 


GGAUGUCUCUCUCCUCUUU 


1852 


GCUCACUCCUUUCUACUAA 


1853 
1854" 


UGAAGCACCUUGAUCUUUG 








GCACAGAUAUUUCCAUUUC 


1855 


GCAGUCCUCUUAUUCCUAA 


1856 


GAACUCAAGCUGCGUUUAU 


1857 


GGACGAAUUGAGAUAAUAU 


1858 






UAAGAACACUGCUCAUGAA 


1859 


GAGGCAUCCUGAAGCAUUA 


1860 


GAAGGCGGCUGCAGAACUG 


1861 
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IKBKG 



TNFRSF6 



RELA 



ARHA 



CDC42 



R0CK1 



PAK1 



1 GGAACAAGGAGAUCAUGUA 


1862 








CUAUCGAGGUCGUUAAAUU 


1863 


GAAUGCAGCUGGAAGAUCU 


1864 


GCGGCGAGCUGGACUGUUU 


1865 


CCAGACCGAUGUGUAUUUA 


1866 








GGUCUCACCUCGCUAUGGU i 


1867 


GAAAGCGAAUUCCUAGACA 


1868 


GCACAAACAAGACUGAUGU 


1869 


GAAGGGCACCUCAGAAACA 


1870 


UCUCCCAACUUGUAUUAAA 


1871 







m 




■1 


UCAAGUGUCUUCCAUCAUG 


1872 


UCAAGUGCCUUAAUAGUAG 




1873 




GGAGUACCCUGAGGCUAUA 







1874 




GAUGAGAUCUUCCUACUGU 


1875 










GAGCUGGGCUAAGUAAAUA 


187(3 


GACCAAAGAUGGAGUGAGA 


1877 


GGAAGAAACUGGUGAUUGU 


1878 


GGCUGUAACUACUUUAUAA 


1879 




GGACAUUUGUUUGCCAUUU ^^^^^^ 






^^^^ 




1880 


GGAGAACCAUAUACUCUUG 


1881 


GAACCAAUGCUUUCUCAUG 


1882 


GAAGACCUGUUAUGUAGAG 


' 1883 


GAUCAAGAAUUGCAAUAUC 


1884 


GAAAAGGGGUGACCUAGUA 






1885 




i UGACAAACCUUAUGGAAAA 







1886 
















GGAAUGAGCUUCAGAUGCA 


1887 


GGACACAGCUGUAAGAUUG 






1888 




GACAAGAGAUUACAGAUAA 






1889 




GAAGAAACAUUCCCUAUUC 


_ 


1890 






fcAGGGUGGuJuiuGAudAA 
















1891 




CAACAAAGAACAAUCACUA 


1892 


GAAGAAAUAUACACGGUUU 


1893 


UACAUGAGCUUUACAGAUA 


1894 



PAK2 
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PAK3 



PAK4 



PAK6 



PAK7 



HDAC1 



HDAC2 



CREBBP 



BTRC 



jGGUAGGAGAUGAAUUGUUU 


1895 


AGAAGGAACUGAUCAUUAA 


1896 


CUACAGACCUCCAAUAUCA 


1897 


GAAACUGGCCAAACCGUUA 


1898 








' G AUUAUCGCUGCAAAGGAA 


1899 


GAGAGUGCCUGCAAGCUUU 


1900 


GACAAGAGGUGGCCAUAAA , 


1901 


UUAAAUCGCUGUCUUGAGA 


1902 








ACUAAGAGGUGAACAUGUA 


1903 


GAUCAUGAAUGUCCGAAGA 


1904 


GAUGAGACCCUACUACUGA 


1905 


! CAGCAAAGGUGCCAAAGAU 


1906 








UAAAGGCAGUUGUCCACUA \ 


1907 


GAAGGGACCUGCUUUCUUG \ 


1908 


GCAAAGACGUCCCUAAGAG 1 1909 


CCAAUGGGCUGGCUGCAAA t 1910 



GAGCACGGCUUUAAUAAGU 



1911 



CAAACUC CGUUAUGAUAUA 
j GGAUAAAGUUG UCUGAUUU 
iGGAAAUGCGUCCAUAAAUA 



1912 



1913 



1914 







GGACAUCGCUGUGAAUUGG 


1915 


AGAAAGAAGUCACCGAAGA 


1916 


'ggacaaggccacccaauga 


1917 


CCACAGCGAUGACUACAUU 


1918 






^^^^ 


1 GCUGUUAAAUUAUGGCUUA 


1919 


GCAAAGAAAGCUAGA^ ] 


1920 


CAUCAGAGAGUCUUAUAUA 


1921 


i CCAAUGAGUUGCCAUAUAA 


1922 






GGCCAUAGCU UAAUUAAUC 


1923 


gcacagccguuDaccauga 


1924 


ggacagcccuuuagucaag" 


1925 


gaacugauuccugaaauaa 1 


1926 








CACAUAAACUCGUAUCUUA 1927 | 
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RIPK2 



VAV1 



VAV2 



GRB2 



PLCG1 



ITGB1 



ITGA4 



STAT1 



GAGAAGGCACUCAAGUUUA 


1928 


AGACAUAGUUUACAGAGAA 


1929 


IGCAGAGAGAUUUCAUAACU 


1930 




^^^^^^^^^^^^^^^^^^ 




GAACAUACCUGUAAAUCAU 


1931 


GGACAUCGACCUGUUAUUA 


1932 


UAAAUGAACUCCUACAUAG 


1933 


GGAAUUAUCUCUGAAGAUA 


1934 








GCAGAAAUACAUCUACUAA 


1935 


GCUAUGAGCUGUUCUUCAA 


1936 


CGACAAAGCUCUACUCAUC 


1937 


GCUCAACCCUGGAGACAUU 


1938 








GGACAAGACUCGCAGAUUU 


1939 


GCUGAGCGCUUUGGAAUAA 


1940 


CAAGAAGUCUCACGGGAAA 


1941 


UCACAGAGGCCAAGAAAUU 


1942 








UGGAAGCCAUCGCCAAAUA 


ff 194211 432 


CAUCAGUGCAUGACGUUUA 


1943 


j UGAAUGAGCUGGUGGAUUA 


1944 


i UGGCAAAACUUACCUAUAA 


1945 






&AGCUGCACUCCAAuiAGA""' ' "1 




GAAACCAAGCCAUUAAUGA 


1947 


CCAAGGAGCUACUGACAUU 


1948 


AGAGAAACAUGGCCCAAUA 


1949 








CCACAGACAUUUACAUUAA 1 


. 1950 


! GAAGGGAGUUUGCUAAAUU 


1951 


i GAACAGAUCUGAUGAAUGA 


1952 


i CAAGAGAGCUGAAGACUAU 


1953 








GCAUAUAUAUUCAGCAUUG 


1954 


CAACUUGACUGCAGUAUUG 


1955 


i GAACUUAACUUUCCAUGUU 


1956 


GACAAGACCUGUAGUAAUU 


1957 








AGAAAGAGCUUGACAGUAA 


1958 


GGAAGUAGUUCACAAAAUA 


1959 
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KRAS2 



HRAS 



BRAF 



ELK1 



RALGDS 



PRKCA 



MAP2K4 



MAP2K7 



UGAAGUAUCUGUAUCCAAA | "1960 


GAGCUUCACUCCCUUAGUU | 1961 






UAAGGACUCUGAAGAUGUA 


1962 


GACAAAGUGUGUAAUUAUG 


1963 


GCUCAGGACUUAGCAAGAA 


1964 


GAAACUGAAUACCUAAGAU 


1965 


GAAACUGAAUACCUAAGAU 


1966 


UAAGGACUCUGAAGAUGUA 


1967 


GACAAAGUGUGUAAUUAUG 


1968 


GCUCAGGACUUAGCAAGAA 


1969 








CCAUCCAGCUGAUCCAGAA 


•^19^^^ 


GAACCCUCCUGAUGAGAGU 


1971 


GAGGACAUCCACCAGUACA 


1972 






^^^^^^^^ 


GAUUAGAGACCAAGGAUUU 


2410 


CCACUGAUGUGUGUUAAUU 


1973 


CAAUAGAACCUGUCAAUAU 


1974 


GAAGACAGGAAUCGAAUGA 


1 1975 








GAUGUGAGUAGAAGAGUUA 


rri 97511 2411 


GGAAGAAUUUGUACCAUUU 


1976 


GAACGACCUUUCUUUCUUU 


1977 


GGAGUCAUCUCUUCCUAUA 


1978 








GGAGAAGCCUCACCUCUUG 


1979 


GCAGAAAGGACUCAAGAUU 


1980 


GAGAACAACUACUCAUUGA 


1981 


GAACUUCUCGUCACUGUAU 


1982 








IGGAUUGUUCUUUCUUCAUA 


1983 


GAAGGGUUCUCGUAUGUCA 


1984 


GAAGAAGGAUGUGGUGAUU 


1985 


GGACUGGGAUCGAACAACA 


1986 










GGACAGAAGUGGAAAUAUU 


1987 


1 UCAAAGAGGUGAACAUUAA 


1988 


GACCAAAUCUCAG UUGUUU 


1989 


GGAGAAUGGUGCUGUUUAA 


1990 
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MAPK8 



MAPK9 



AIF1 



BBC3 



BCL2L1 



BCL2L11 



GAAGAGACCAAAGUAUAAU 


1991 


GAAGACCGGCCACGUCAUU 


1992 


GGAAGAGACCAAAGUAUAA 


T 1993 


GCAUUGAGAUUGACCAGAA 


1994 


UGAGAGAACGAGAAAGUUG 


1995 


GUGAAACCCUGUCUGCAUU 


1996 


GGAUCUCUCUCAACAACUA 


1997 


ACAACUAGGUGAACACAUA 


1998 








UCACAGUCCUGAAACGAUA 


1999 


GAUUGGAGAUUCUACAUUC 


2000 


GCUCAUGGAUGCAAAUCUU 


2001 


GAAGGUAAGCCGACCAUUU 


^ 2002 








AAAGAGAGCUUAUCGUGAA 


2003 


GAUGAUAGGUUAGAAAUAG 


' 2004 


■ ACAAAGAAGUCAUGGAUUG 


2005 


GGAGCUGGAUCAUGAAAGA 


2006 








GAAAAGGGAUGAUGGGAUU 


2007 


CCUAGACGAUCCCAAAUAU 


2008 


GAGCCAAACCAGGGAUUUA 


2009 


UGAAACGAAUGCUGGAGAA i 2010 


UCACUCACCCAGAGAAAUA 


2011 


CCAAGAAAGCUAUCUCUGA 


2012 


AGACUCACCUAGAGCUAAA 


2013 








CCUGGAGGGUCCUGUACAA ' 


2014 


GAGCAAAUGAGGCAAACGU 


2015 


GGAGGGUCCUGUACAAUCU 


2016 


GACUUUCUCUGCACCAUGU 


2017 




CCAGGGAGCUUGAMGl^^^^^ 






2018 


AAAGUGCAGUUCAGUAAUA 


2019 


GAGAAUCACUAACCAGAGA ' 


2020 


GAGCGCAUCCCUAUUAUAA 


2021 




GAGACGAGUUUAACGCUUA ^ 






2022 


AAAGCAACCUUCUGAUGUA 


2023 


CCGAGAAGGUAGACAAUUG 


2024 


GCAAAGCAACCUUCUGAUG 


2025 


AGACAGAGCCACAAGGUAA 


2026 


GCAAGGAGGUUAGAGAAAU 


2027 
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BID 



BIRC2 



BIRC3 



BIRC4 



BIRC5 



BRCA1 



CARD4 



CASP10 



CAAGGAGGUUAGAGAAAUA 


2028 


"UCUUACGACUGUUACGUUA 


2029 








GAAGACAUCAUCCGGAAUA 


2030 


GAACAGCGUUCCUAGAGAA 


2031 


f^GAAAUGGGAUGGACUGAAC 


2032 


, ACGAUGAGCUGCAGACUGA 


2033 








GAAAGAAGCCUGCAUAUAA 


f 2034 


GAAAUUGACUCUACAUUGU 


2035 


ACAAAUAGCACUUAGGUUA 


2036 


GAAUACACCUGUGGUUAAA 


2037 








GGAGAUGCCUGCCAUUAAA 


r 2038 


UCAAUGAUCUUGUGUUAGA 


2039 


GAAAGAACAUGUAAAGUGU 


2040 


GAAGAAAGAACAUGUAAAG 


2041 








GUAGAUAGAUGGCAAUAUG 


2042 


GAGGAGGGCUAACUGAUUG 


2043 


GAGGAACCCUGCCAUGUAU 


2044 


GCACGGAUCUUUACUUUUG 


2045 




GGCGUAAGAUGAUGGAUUU ^ 






2046 


iGCAAAGGAAACCAACAAUA 


2047 


GCACAAAGCCAUUCUAAGU ' 


2048 


CAAAGGAAACCAACAAUAA 


2049 








CCAUAGAGCUUCAUAAAUA 


2050 


GAAGAGAACUUAUCUAGUG 


2051 


GAAGUGGGCUCCAGUAUUA 


2052 


GCAAGAUGCUGAUUCAUUA 


2063 


CCAUAGAGCUUCAUAAAUA ' 


2054 








G^^AGUUAAUGUCMGGAA 


2055 


GAGCAACACUGGCAUAACA \ 


2056 


UAACAGAGAUUUGCCUAAA s 


2057 


GCGAAGAGCUGACCAAAUA " ! 


2058 








CAAAGGGUUUCUCUGUUUA ^ 


2059 


GAAAUGACCUCCCUAAGUU 


2060 
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CASP2 



CDKN1A 



CFLAR 



CLK2 



CLSPN 



CSNK2A1 



CTNNB1 



CXCR4 



GAAGGCAGCUGGUAUAUUC 


2061 


GACAUGAUCUUCCUUCUGA 


' 2062 


GCACUCUUCUGUUCCCUUA 


2063 








GUAUUAAACUCUCGUUUGA 


2064 


GCAAGGAGAUGUCUGAAUA 


2065 


CAACUUCCCUGAUCUUUAA 


2066 


GCUCAAAGAUGUAAUGUAG 


2067 








GAACAAGGAGUCAGACAUU 


2068 


AAACUAGGCGGUUGAAUGA 


2069 


GAUGGAACUUCGACUUUGU 


2070 


t GUAAACAGAUGGCACUUUG 


2071 








GAUGUGUCCUCAUUAAUUU 


2072 


GAAGAGAGAUACAAGAUGA 


2073 


GAGCAUACCUGAAGAGAGA 


' 2074 


GCUAUGAAGUCCAGAAAUU 


2075 




GUGAAUAUGUGAAAUAGUG ^ 


f ' 2076 






AAAGCAUGCUAGAGUAUGA 


2077 


UUAAGAAUGUGGAGAAGUA 


2078 


GAUAAGAAGCUGACAGAUA 


2079 




GGACGUAAUUGAUGAAGl^^^^^ 






2080 


GCAGAUGGGUUCUUAAAUG 


2081 


CAAAUGAGGUU6AGGAAAU 


2082 


GGAAAUACCUGGAGGAUGA 


2083 








GAUCCACGUUUCAAUGAUA 


2084 


GCAUUUAGGUGGAGACUUC 


2085 


GAUGUACGAUUAUAGUUUG 


2086 


UGAAUUAGAUCCACGUUUC 


2087 








'(3ca'caagaauggaucagaa 


2088 


GCUGAAACAUGCAGUUGUA 


2089 


GUACGUACCAUGCAGAAUA ' 


2090 


GAACUUGCAUUGUGAUUGG 


2091 








GAAGCAUGACGGACAAGUA 


2092 


GAACAUUCCAGAGCGUGUA 


2093 
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CXCR6 



DAXX 



GAS41 



GTSE1 



HDAC3 



HDAC5 



HEC 



HIST1H2 



GUUCUUAGUUGCUGUAUGU 


1 2094 


CAUGAUGGUUGGCCUUAUC 


1 2095 










GGAACAAACUGGCAAAGCA 


2096 


GAUCAGAGCAGCAGUGAAA 


2097 


GGGCAAAACUGAAUUAUAA 


2098 


GAUCUCAGGUUCUCCUUGA 


2099 










CUACAGAUCUCCAAUGAAA 


2100 


GCUAGAAGCUGGAGAAUGA 


2101 


GGAAACAGCUAUGUGGAAA 


2102 


GGAGUUGGAUCUCUCAGAA 


2103 








guaguXagcuaaacugaaa 


2104 


GACAAUAUGUUCAAGAGAA 


2105 


GACAACAUCUCGUCAGCUA 


2106 


UAUAUGAUGUGUCCA6UAA 


2107 








CAAAGAAGCUCACUUACUG 1 2108 


GAACAGCCCUAAAGUGGUU 


2109 


GAACAUGGAUGAGCCUAAG 


2110 


GGGCAAAGCUAAAUCAAGU 


2111 








GGAAAGCGAUGUGGAGAUU 


2112 


CCAAGACCGUGGCCUAUUU 


2113 


AAAGCGAUGUGGAGAUUUA 


2114 


GUGAGGAGCUUCCCUAUAG 


2115 






^^^^^^^^^ 


i GAAUUCCUCUUGUCGAAGU 


2116 


1 GUUAUUAGCACCUUUAAGA 


2117 


GGAGGGAGGCCAUGACUUG 


2118 


CAGGAGAGCUCAAGAAUGG 


2119 


GGAUAUGGAUUUCAGUUAA 


2120 


GGAAGUCGGUGCCUUGGUU 1 


2121 


GGAAGGAGAGGACUGGUUU 


2122 








GCAGAUACUUGCACGGUUU 


2123 


GAGUAGAACUAGAAUGUGA 


2124 


GCGAAUAAAUCAUGAAAGA 


2125 


GAAGAUGGAAUUAUGCAUA 


2126 
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AA 



LMNB1 



LMNB2 



MYB 



MYT1 



NFKBIB 



NFKBIA 



NFKBIE 



NUMA1 



GGCAAUGCGUCUCGCGAUA 


1 2127 


GAUCCGCAAUGAUGAGGAA 


2128 


GCAAUGCGUCUCGCGAUAA - 


2129 


GAGGAACUCAAUAAGCUUU 


2130 








AAUAGAAGCUGUGCAAUUA 


2131 


CAACUGACCUCAUCUGGAA 


1" 2132 


GAAGGAAUCUGAUCUUAAU 


2133 


GGGAAGGGUUUCUCUAUUA 


2134 








GGAGGUUCAUUGAGAAUUG 


2134 


GGCAAUAGCUCACCGUUUA 


2135 


CAAAUACGCUUAGCUGUGU 


2136 


GGAGAUCGCCUACAAGUUC 


2137 






^^^^^^^^^^ 


GCAGAAACACUCCAAUUUA 


2138 


GUAAAUAGGUGAAUGCAUU 


2139 


GCACUGAACUUUUGAGAUA 


2140 


GAAGAACAGUCAUUUGAUG 


1 2141 








GAGGUGAGCUGUUAAAUCA 


2142 


GCAGGGUGAUUUCCUAAUA 


2143 


GGGAGAAGAUAUUUAAUUG 


2144 


CAACUUCUCUCCUGAACUU 


2145 








GGACACGGCACUGCACUUG 


2146 


GCACUUGGCUGUGAUUCAU 


^ 2148 


GAGACGAGGGCGAUGAAUA 


2149 


CAUGAACCCUUGCUGGAUU 


'2150 








GAACAUGGACUUGUAUAUU " 


2151 


GAUGUGGGGUGAAAAGUUA 


2152 


GGACGAGAAAGAUCAUUGA ' 


2153 


AGGACGAGCUGCCCUAUGA 


2154 












GAAGGGAAGUUUCAGUAAC 


2155 


GGAAGGGAAGUUUCAGUAA 


2156 


GGAAACUGCUGCUGUGUAC 


2157 


GAACCAACCACUCAUGGAA 


2158 








1 GGGAACAGUUUGAAUAUAA 


2159 1 
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[CGAGAAGGAUGCACAGAUA " j- 2161^7 
jGCAAGAGGCUGAGAGGAAA" j" "'2162' 

GAAGACAAAUGAA^CU/S^ _ j^I^t_^63'J^ 
GAUAAAGACUGCUG UUAGA "_ r_l_ 2164__ 
G AGGAGAGCUCU AAUAUUA" ~"~ 1 " T165 
GAGGAAGCCUG A UpjiW^GA " - -| 











GAAAGAGCAUGAUGACAUA 




GAGGAGAGCUCUAUUAUGU 


2168 


GAAACUGAAUGGAAGAAUA 


2169 


AAAGAAGGCUGUACCGUUA 


2170 



PARVA 




^^^^^^^^ 




CUACAUGUCUUUGCUCUUA 


i 2171 




GCUAAGUCCUGUAAGAAUA 


j 2172 




CAAAGGCAAUGUACUGUUU 


1 2173 




GAACAAUGGUGGAUCCAAA 


1 2174 






PIK3CG 








AAGUUCAGCUUCUCUAUUA 


2175 




GAAGAAAUCUCUGAUGGAU 


2176 




GAACACCUUUACUCUAUAA 


2177 


1 GCAUGGAGCUGGAG AACUA 


'2178 






PRKDC 








GAUGAAAGCUCUAAAGAUG 


2T79 




GAAAGGAGGUUCUAAACUA 


2180 




GGAAGAAGCUCAUUUGAUU 


2181 




GCAAAGAGGUGGCAGUUAA 


2182 






RASA1 








GGAAGAAGAUCCACAUGAA 


2183 




GAACAUACUUUCAGAGCUU 


2184 




GAACAAUCUUUGCUGUAUA 
UAACAG^AACUGCUUCAACA " 


"2185 
'21 86 


SLC9A1 




^^^^^ 




GAAGAGAUCCACACACAGU 


2187 




UCAAUGAGCUGCUGCACAU 


2188 




GAAGAUAGGUUUCCAUGUG 


2189" 




GAAUUACCCUUCCUCAUCU ! 


2190 






TEGT 


CUACAGAGCUUCAGUGUGA ' ^ 








2191 




GAACAUAUUUGAUCGAAAG i 


2192 
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TERT 



T0P1 



T0P2A 



T0P3A 



T0P3B 



WEE1 



! GCAUUGAUCUCUUCUUAGA 


2194 








feGAAGACAGUGGUGAACUU 


2195 


GCAAAGCAUUGGAAUCAGA 


2196 


GAGCUGACGUGGAAGAUGA 


2197 


GAACGGGCCUGGAACCAUA 


2198 



TNFRSF6 I 







! gauacuaacugcucucXga 


2199 


, gaaagaauggugucaauga 


2200 


UCAAUAAUGUGCCAUGUAA 


2201 


ucaugaaucuccaaccuua 


2202 


GAUGUUGACUUGAGUAAAU 


2203 







GAAAGGAAAUGACUAAUGA 


2204 


GAAGAAGGCUGUUCAGAGA ^ 


2205 


GGAAGUAGCUACGUUCUUU 


2206 


GGACAUAAGUGGAAAGAAG 


r" 2207 



i GAAAGAGUCCAUCAGAUUU 



! CAAACUACAUUGGCAUUUA 



AAACAGACAUGGAUGGAUA 



CGAAAGGAAUGGUUAACUA 



2208 



2209 



2210 



2211 



CCAGAAAUCUUCCACAGAA 



GAAACUAUCUGGAUGUGUA 



CCACAAAGAUGGUAUCGUA 



GGAAAUGGCUGUGGUAACA 



2212 



2213 



2214 



2215 



! GAGACAAGAUGAAGACUGU 



GCACAUGGGCUGCGUCUUU 



I CCAGUGCGCUUCAAGAUGA 
rGAAGAUCUGCUUUGAGGUU 




1 GGUAUUGGCUUGUGAAUUU 



i gcagaacaauuacgaauag 

FguacauagcuguOugaaau 



2220 



2221 



2222 



i GCUGUAAACUUGUAGCAUU 



2223 
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Please replace the paragraph on page 146, spanning lines 8-12, and replace it with the 
following amended paragraph: 

the interleukin 4 recq)tor gene 

(SEQ. ID NO. 2224: UAGAGGUGCUCAUUCAUUU, 
SEQ. ID NO. 2225: GGUAUAAGCCUUUCCAAGA, 
SEQ. ID NO. [[2225]] 2412: ACACACAGCUGGAAGAAAU, 
SEQ. ID NO. 2226: UAACAGAGCUUCCUUAGGU), 

Please replace the paragraph on page 148, spanning lines 8-19, and replace it with the 
following amended paragraph: 

activin A receptor IB 

(SEQ. ID NO. 2255: ACAAGACGCUCCAGGAUCU, 
SEQ. ID NO. [[2254]] 2413 : GCAACAGGAUCGACUUGAG, 
SEQ. ID NO. [[2255]] 2414 : GAAGCUGCGUCCCAACAUC, 
SEQ. ID NO. 2256: GCAUAGGCCUGUAAUCGUA, 
SEQ. ID NO. 2257: UCAGAGAGUUCGAGACAAA, 
SEQ. ID NO. 2258: UGCGAAAGGUUGUAUGUGA, 
SEQ. ID NO. 2259: GCAACAGGAUCGACUUGAG, 
SEQ. ID NO. 2260: GAAUAGCGUUGUGUGUUAU, 
SEQ. ID NO. 2261: UGAAUAGCGUUGUGUGUUA, 
SEQ. ID NO. 2262: GGGAUCAGUUUGUUGAAUA, 
SEQ. ID NO. 2263: GAGCCUGAAUCAUCGUUUA, ), 

Please replace the paragraph on page 150, spaiming lines 22-26, and replace it with the 
following amended paragraph: 



N0S3 

(SEQ. ID NO. 2308: UGAAGCACCUGGAGAAUGA, 
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SEQ. ID NO. 2309: CGGAACAGCACAAGAGUUA, 
SEQ. ID NO. 2310: GGAAGAAGACCUUUAAAGA, 
SEQ. ID NO. [[2309]] 2415 : GCACAAGAGUUAUAAGAUC), 



Please replace the paragraph on page 150, spanning lines 28-32, and replace it with the 
following amended paragraph: 

ARH 

(SEQ. ID NO. [[2310]] 2416: CGAUACAGCUUGGCACUUU, 
SEQ. ID NO. 2311: GAGAAGCGCUGCCCUGUGA, 

SEQ. ID NO. 2312: GAAUCAUGCUGUUCUCUUU, 
SEQ. ID NO. 2313: GGAGUAACCGGACACCUUA), 
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Applicants have amended the specification by assigning SEQ. ID NOS. to 
sequences lacking them, and changing sequence ID numbers to correct instances where 
more than one sequence was assigned to a single SEQ. ID NO. Applicants have also 
amended the specification to correct four typographical errors consisting of dual 'T" 
residues at the end of each of SEQ. ID NOS. 1413-1416, The amendments add no new 
rriatter. Applicants respectfiilly request entry of the amendments. Substitute 
specification pages reflecting the amendments would be provided to the Examiner upon 
the Examiner's request. 

No fee is required in connection with the filing of this Preliminary Amendment. 
If any fee is deemed necessary, authorization is hereby given to charge the amount of any 
such fee to Deposit Account No. 1 1-0171. 



REMARKS 




Respectfially submitted. 



Tor Smeland 
Registration No. : 43 , 1 3 1 
Attorney for Applicants 



Kalow & Springut LLP 
Telephone No.: (212)813-1600 
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Serial No.: 10/714,333 
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488 Madison Avenue, 19th Floor 
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June 30, 2004 



Commissioner for Patents 

P.O. Box 1450 

Alexandria, VA 22313-1450 



AMENDMENT DIRECTING ENTRY OF SEQUENCE LISTING 



Sir: 



This amendment directs entry into this application of the sequence listing filed by 
Express Mail on June 29, 2004. 



Certificate of Transmission Under 37 C.F.R. 1.8 

I hereby certify that this correspondence is being facsimile transmitted to the United States Patent and Traifemark OflBce (Fax. No. 
(703) 746-4060) on the date shown below. 

(Sipature) (Printed Name of Poson Signing Certificate) (Date) 
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Amendments to the Specification; 

Please insert the sequence listing filed on June 29, 2004, after page 159 and before page 



160. 
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REMARKS 

Applicants herein request entry of the sequence listing filed on June 29, 2004 into 
the instant application. In response to a Notice to Comply with Requirements for Patent 
Applications Containing Nucleotide Sequence and/or Amino Acid Sequence Disclosures 
(Filing Date Granted) mailed February 12, 2004, Applicants filed a sequence listing on 
CD-ROM, in lieu of a paper sequence Usting, on June 29, 2004 by Post Office to 
Addressee Express Mail, addressed to Box Missing Parts, with a petition and fee for a 
three-month extension of time. Applicants herein request entry of the sequence listing 
filed, which listing is a text file entitled "13499US.txf ' on the disk entitled "FILE COPY 
- SEQUENCE LISTING COPY 1 OF 3 (PAPER COPY)". 

Support for the amendment can be found in the application as filed, for example, 
in Table_12.txt, Table_13.txt, Table_14.txt, and Table_15.txt, which are tables related to 
nucleotide sequences that were filed on CD-ROM on the application's filmg date, in 
duplicate, on a compact disk labeled "DOCKET 13499, PATENT APPLICATION, 
DISK 1 OF 1, COPY 1 OF 2." The amendment adds no new matter. Applicants 
respectfiilly request entry of the amendment. 

Conclusion 

No fee is believed to be due with respect to the filing of this amendment If any 
additional fees are due, or an overpayment has been made, please charge, or credit, our 
Deposit Account No. 11-0171 for such sum. 

If the Examiner has any questions regarding the present application, the Examiner 
is cordially invited to contact Applicant's attorney at the telephone number provided 
below. 

I 
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Respectfully submitted. 




Tor Smeland 
Registration No.: 43,131 
Attorney for Applicant 



Kalow & Springut LLP 
Telephone No.: (212)813-1600 



f 

ITEM 4 
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PATENT DOCKET 13499USC2 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicant: KHVOROY A et al. Examiner: To be assigned 

Serial No.: 10/714,333 Group Art Unit: 1646 

Filed: November 14, 2003 

For: Functional and Hyperfunctional siRNA 

Customer No.: 23719 

Kalow & Springut LLP 

488 Madison Avenue, 19th Floor 

New York, New York 1 0022 

April 21, 2005 

Commissioner for Patents 

P.O. Box 1450 

Alexandria, VA 22313-1450 

SUPPLEMENTAL PRELIMINARY AMENDMENT 

Dear Sir: 

Prior to examination on the merits, please enter the following amendments in the 
above-identified application. 



Certificate of Transmission Under 37 C.F.R. 1.8 

I hereby certify that this correspondence is being facsimile transmitted to the United States Patent and Trademark Office (Fax. No. 
(703) 872-9306) on the date shown below. 

(Sl^naturej" (PrintedNameof Person Signing Certificate) (Date) 
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Amendments to the Claims: 

This listing will replace all prior versions, and listings, of claims in the application: 
Listing of Claims; 

1 . (original) A method for selecting siRNA comprising selecting an siRNA 
molecule of 19 - 25 nucleoside bases, said method comprising: 

(a) selecting a target gene; 

(b) measuring the functionality of sequences of 19 - 25 nucleotides in length that 
are substantially complementary to a stretch of nucleotides of the target 
sequence, wherein said functionality is dependent upon non-target specific 
criteria. 

2. (original) The method according to claim 1 wherein said functionality is 
determined by applying one of the following formulas: 

Formula I = .(GC/3) +(AUi5-i9) -<Tm2o«c)*3 -(Gi3)*3 -(€,9) +(A,9)*2 +(A3) 
+(Uio)+(Ai4)-(U5)-(Aii); 

Formula II = -(GC/3) -<AUi5.i9)*3 -(Gi3)*3 -(C19) +(A,9)*2 +(A3); 
Formula III = -(GC/3) +(AUi5-i9) -(Tm2ox)*3; 

Formula IV = -(GC/2)-f(AUi5.i9)/2-(Tm2ooc)*2 -(Gi3)*3 -(C19) +(A,9)*2 +(A3) 
+(Uio)+(Ai4)-(U5)"(AiO; 

Formula V = -(Gi3)*3 -(C19) +(A,9)*2 +(A3) + (U,o)+(A,4) -(U5) -(An); 



Formula VI = -(Gi3)*3 -<Ci9) +(Ai9)*2 +(A3); 
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Formula VII = -(GC/2) +(AU,5.i9)/2 -( Trnzo^c)*! -(G,3)*3 -(C19) +(A,9)*3 
+(A3)*3 +(U,o)/2+(A,4)/2 -(U5)/2 -(An)/2; 



wherein in Formulas I - VII: 



AU15-19 = 0-5 depending on the number of A or U bases on the sense 

strand at positions 15-19; 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its 

value is 0; 

C|9 = 1 if C is the base at position 19 of the sense strand, otherwise its 

value is 0; 

GC = the number of G and C bases in the entire sense strand; 
Tm 2ox= 1 if the Tm is greater than 20°C; 

A3 = 1 if A is the base at position 3 on the sense strand, otherwise its 
value is 0; 

All = 1 if A is the base at position 1 1 on the sense strand, otherwise its 
value is 0; 

Ai4 = 1 if A is the base at position 14 on the sense strand, otherwise its 
value is 0; 

Ai9 = 1 if A is the base at position 19 on the sense strand, otherwise its 
value is 0; 

U5 = 1 if U is the base at position 5 on the sense strand, otherwise its 
value is 0; 

Uio = 1 if U is the base at position 10 on the sense strand, otherwise its 
value is 0; 



or. 
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Formula VIII: (-14)*Gi3-13*Ai-12*U7-ll*U2-10*An-10*U4-10*C3-10*C5-10*C6- 

9*Aio-9*U9-9*Ci8-8*Gio-7*U,-7*Ui6-7*C,7-7*C,9+7*Ui7+8*A2+8*i^ 
+8*A5+8*C4+9*G8+10*A7+10*Ui8+ll*Ai9+ll*C9+15*Gi+ 
18*A3+19*Uio-Tm -3* (GCtotai) - 6*(GC,5-i9)- 30*X; and 

Formula IX: (14.1)*A3+(14.9)*A6-^(17.6)*A,3+(24.7)*A,9+(14,2)*U,o+(10.5)* 
C9+(23.9)*G,+(16.3)*G2+(-12.3)*Ai,+(-19.3)*U,+(-12.1)*U2+ 
(-n)*U3+(-15.2)*Ui5+(-llJ)*U,6+(-11.8)*C3+(-17.4)*C6+(-^ 
(-13.7)*G,3+(-25.9)*G,r-Tm-3* (GCtotai)-6*(GCi5-i9)- 30*X 

wherein 

Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 
A2 == 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 
A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 
A4= 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
A5= 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
A6= 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 
A7= 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 
Aio = 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 
Ai I = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 
Ai3 = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 
Ai9 = 1 if A is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 

C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 

C5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 
Ce = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 
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C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 
€9=1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
Ci7 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
C18 = 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 
Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 



Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
Gio = 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 
Gi3 =1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
Gi9 = 1 if G is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U9 == 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
Uio = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
Ui5 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 
U16 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 
Ui7 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
U18 = 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 



GCi5_i9 = the number of G and C bases within positions 15 - 19 of the sense strand 
within positions 15 -18 if the sense strand is only 18 base pairs in length; 
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GCtotai the number of G and C bases in the sense strand; 

Tm= 100 if the targeting site contains an inverted repeat longer then 4 base pairs, 

otherwise its value is 0; and 
X = the number of times that the same nucleotide repeats four or more times in a row. 

3. (original) A method of gene-silencing comprising selecting an siRNA according 
to claim 2 and introducing it into a cell. 

4. (original) The method according to claim 3 wherein said introducing is by 
allowing passive uptake of the siRNA. 

5. (original) The method according to claim 3, wherein said introducing is through 
the use of a vector. 

6. (original) A method for developing an siRNA algorithm for selecting siRNA, 
said method comprising: 

(a) selecting a set of siRNA; 

(b) measuring the gene silencing ability of each siRNA from said set; 

(c) determining the relative functionality of each siRNA; 

(d) determining the amount of improved functionality by the presence or 
absence of at least one variable selected from the group consisting of the 
total GC content, melting temperature of the siRNA, GC content at 
positions 15-19, the presence or absence of a particular nucleotide at a 
particular position and the number of times that the same nucleotide 
repeats within a given sequence; and 

(e) developing an algorithm using the information of step (d). 



7. 



(original) A method of selecting an siRNA with improved functionality, said 
method comprising using the algorithm of claim 6. 
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8. (original) A method of selecting hyperfunctional siRNA, said method comprising 
using at least one functional siRNA, wherein at least one said functional siRNA 
has been selected according to the method of claim 7 and measuring the silencing 
ability of said at least one functional siRNA, wherein silencing ability is measured 
at a concentration of less than. 1 nanomolar siRNA. 

9. -18. (canceled) 

19. (original) A kit, wherein said kit is comprised of at least two siRNA, wherein 

said at least two siRNA comprise a first optimized siRNA and a second optimized 
siRNA, wherein said first optimized siRNA and said second optimized siRNA are 
optimized according to one of the following formulas: 

Formula I = -(GC/3) +(AU,5.i9) -(Tm2o°c)*3 ~(Gi3)*3 ~(Ci9) +(A,9)*2 ^(Aa) 
+(Uio)+(Ai4)-(U5)-KAn); 

Formula II - -(GC/3) ^AUi5-i9)*3 AGnT'i -(C19) +(Ai9)*2 +(A3); 
Formula III = -(GC/3) +(AUi5.i9) -(Tm2ox)*3; 

Formula IV = -(GC/2)+(AU,5-i9)/2-(Tm2ox)*2 -(Gi3)*3 -(C19) +(Ai9)*2 ^(Ab) 
+(Uio)+(Ai4)-(U5)^An); 

Formula V = -(Gi3)*3 -(C,9) +(Ai9)*2 +(A3) + (Uio)+(Ai4) -(U5) -(An); 
Formula VI = -(Gi3)*3 -(C19) +(Ai9)*2 +(A3); 
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Formula VII = -(GC/2) +(AUi5.i9)/2 -( Tm2ox)*l -<Gi3)*3 ^0,9) +(Ai9)*3 +(A3)*3 
+(Uio)/2+(A,4)/2 -(U5)/2 -(A, ,)/2; 



wherein in Formulas I - VII: 



AU15-19 = 0-5 depending on the number of A or U bases on the sense 
strand at positions 15-19; 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its 
value is 0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise its 
value is 0; 

GC the number of G and C bases in the entire sense strand; 
Tm 20X = 1 if the Tm is greater than 20^C; 

A3 = 1 if A is the base at position 3 on the sense strand, otherwise its 
value is 0; 

All = 1 if A is the base at position 1 1 on the sense strand, otherwise its 
value is 0; 

Ai4 = 1 if A is the base at position 14 on the sense strand, otherwise its 
value is 0; 

Ai9 = 1 if A is the base at position 19 on the sense strand, otherwise its 
value is 0; 

U5 = 1 if U is the base at position 5 on the sense strand, otherwise its 
value is 0; 

Uio= 1 if U is the base at position 10 on the sense strand, otherwise its 
value is 0; 
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Formula VIII: (-14)*Gi3-13*Arl2*U7-li*U2-10*Au-10*U4-10*C3"10*C5-10*^^^ 
9*U9-9*Ci8-8*Gio-7*Ui-7*Ui6-7*Ci7-7*Ci9+7*U,7+8*A2+8*A4 
+8*A5+8*C4+9*G8+10*A7-M0*Ui8+ll*Ai9+ ll*C9+15*Gi+ 
18*A3+19*Uio-Tm -3* (GCtotai) - 6*(GCi5^i9)- 30*X; and 

Formula IX: (14.1)*A3+(14.9)*A6+(17.6)*A,3+(24.7)*Ai9+(14.2)*Uio+(10.5)* 
C9+(23.9)*Gi+(16.3)*G2+(-12.3)*An+(-19.3)*Ui+(-12.1)*U2+ 
(-ll)*U3+(-15.2)*Ui5+(-llJ)*U,6+(-11.8)*C3+(-17.4)*C6+(-10.5)*^^^ (- 
13.7)*Gi3+(-25.9)*Gi9-Tm-3* (GQotai) - 6*(GCi5-i9)- 30*X 

wherein 

Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 
A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 
A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 
A4= 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
A5 = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
Ae = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 
A7 = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 
Aio= 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 
Ai 1 = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 
Ai3 = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 
Ai9= 1 if A is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 

C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 
C5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 
C6 = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 
C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 
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C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
Ci7 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
C18 = 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 
Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 

Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
Gio= 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
Gi9 = 1 if G is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
Uio = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
Ui5 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 
U16 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 
Ui7 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
U18 = 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 

GCi5_i9 = the number of G and C bases within positions 15 - 19 of the sense strand or 

within positions 15 -18 if the sense strand is only 18 base pairs in length; 
GCtotai = the number of G and C bases in the sense strand; 
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Tm= 100 if the targeting site contains an inverted repeat longer then 4 base pairs, otherwise 
its value is 0; and 

X = the number of times that the same nucleotide repeats four or more times in a row. 
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Claims 1-19 are pending in the application. This Supplemental Preliminary 
Amendment cancels claims 9-18 without disclaimer. Applicants reserve the right to file 
one or more divisional or continuation applications based on the canceled claims. The 
amendment adds no new matter. Applicants respectfully request entry of this 
amendment. 

No fee is required in connection with the filing of this Preliminary Amendment. 
If any fee is deemed necessary, or overpayment has been made, please charge, or credit, 
Deposit Account No. 11-0171. 
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Respectfully submitted, 




Tor Smeland 
Registration No.: 43,131 
Attorney for Applicants 
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Functional and Hyperfunctional siRNA 

5 Cross Reference to Related AopBcations 

This application claims the benefit of the filing date of U.S. Provisional 
Application Serial No. 60/426,137, filed November 14, 2002, entitled "Combinatorial 
Pooling Approach for siRNA hiduced Gene Silencing and Methods for Selecting 
siRNA," and U.S. Provisional Application Serial No. 60/502,050, filed September 10, 
10 2003, entitled "Methods for Selecting siRNA," the entire disclosures of which are 
hereby incorporated by reference into the present disclosure. 

Field of Invention 

The present invention relates to RNA interference ("RNAi"). 
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Background of the Invention 

Relatively recently, researchers observed that double stranded RNA 
("dsRNA") could be used to inhibit protein expression. This ability to silence a gene 
has broad potential for treating human diseases, and many researchers and 
20 commo-cial entities are currently investing considerable resources in developing 
therapies based on this technology. 

Double stranded JEINA induced gene silencing can occur on at least three 
different levels: (i) transcription inactivation, which refere to RNA guided DNA or 
25 histone methylation; (ii) siRNA induced mRNA degradation; and (iii) mRNA induced 
transcriptional attenuation. 

It is generally considered that the major mechanism of RNA induced silencing 
(RNA interference, or RNAi) in mammalian cells is mRNA degradation. Initial 
30 attempts to use RNAi in mammalian cells focused on the use of long strands of 
dsRNA. However, these attempts to induce RNAi met with limited success, due in 
part to tile induction of the interferon response, which results m a general, as opposed 
to a target-specific, inhibition of protein syntiiesis. Thus, long dsRNA is not a viable 
option for RNAi in mammalian systems. 



More recently it has been shown that when short (1 8-30 bp) RNA duplexes are 
introduced into mammalian cells in culture, sequence-specific inhibition of target 
mRNA can be realized without inducing an interferon response. Certain of these 

5 short dsRNAs, referred to as small inhibitory RNAs ("siRNAs"), can act catalytically 
at sub-molar concentrations to cleave greater than 95% of the target mRNA in the 
cell. A description of the mechanisms for siRNA activity, as well as some of its 
applications are described in Provost et al, Ribonuclease Activity and RNA Binding of 
Recombinant Human Dicer. E.M.B.O. J., 2002 Nov. 1; 21(21): 5864-587^4; Tabara ef 

10 al. The dsRNA Binding Protein RDE-4 Interacts with RDE-l. DCR-l arid a DexH- 
boxHelicase to Direct RNAi in C. elegans. Cell 2002, June 28;l69(7):861-71; Ketting 
et al, Dicer Functions in RNA Interferertceand in Synthesis of Small RNA Involved in 
Developmental Timing in C. elegans; Martinez et al, Single-Stranded Antisense 
SiRNAs Guide Target RNA Cleavage in RNAi, Cell 2002, Sept. 6; 110(5):563; 

1 5 Hutvagner & Zamore, A microRNA in a multiple-turnover RNAi enzyme complex. 
Science 2002, 297:2056. 

From a mechanistic perspective, introduction of long double stranded RNA 
into plants and invertebrate cells is broken down into siRNA by a Type III 
20 endonuclease known as Dicer. Sharp, RNA interference— 2001 . Genes Dev. 2001, 
15:485. Dicer, a ribonuclease-IIMike enzyme, processes the dsRNA into 19-23 base 
pair short interfering RNAs with characteristic two base 3' overhangs. Bernstein, 
Caudy, Hammond, & Harmon, Role for a bidentate ribonuclease in the initiation step 
of RNA interference. Nature 2001, 409:363. The siRNAs are then incorporated into 

25 an RNA-induced silencing complex (RISC) where one or more helicases unwind the 
siRNA duplex, enabling the complementary antisense strand to guide target 
recognition. Nykanen, Haley, &. Zamore, ATP requirements and small interfering 
RNA structure in the RNA interference pathway, Cell 2001 , 1 07:309. Upon binding to 
the appropriate target mRNA, one or more endonucleases within the RISC cleaves the 

30 target to induce silencing. Elbashir, Lendeckel, & Tuschl, RNA interference is 
mediated by 21- and 22-nucleotide RNAs. Genes Dev 2001 ,15:188, Figure 1. 

The interference effect can be long lasting and may be detectable' afta- many 
cell divisions. Moreover, RNAi exhibits sequence specificity. Kisielow, M. et al 



(2002) Isoform-specific knockdown and expression of adaptor protein ShcA using 
small interfering RNA, J. of Biochemistry 363: 1-5. Thus, the RNAi machinery can 
specifically knock down one type of transcript, while not affecting closely related 
mRNA. These properties make siRNA a potentially valuable tool for inhibiting gene 
5 expression and studying gene function and drug target validation. Moreover, siRNAs 
are potentially useful as therapeutic agents against: (1) diseases that are caused by 
over-expression or misexpression of genes; and (2) diseases brought about by 
expression of genes that contain mutations. 

1 0 Successful siRNA-dependent gene silencing depends on a number of factors. 

One of the most contentious issues in RNAi is the question of the necessity of siRNA 
design, i.e., considering the sequence of the siRKA used. Early work in C. elegans 
and plants circumvented tlie issue of design by introducing long dsRNA (see, for 
instance. Fire, A. et al. (1998) Nature 391 :806-81 1). In this primitive organism, long 

15 dsRNA molecules are cleaved into siRNA by Dicer, thus generating a diverse 

population of duplexes that can potentially cover the entire transcript While some 
fi-action of these molecules are non-fiinctional (i.e. induce little or no silencing) one or 
more have the potential to be highly functional, thereby silencing the gene of interest 
and alleviating the need for siRNA design. Unfortunately, due to the interferon 

20 response, this same approach is unavailable for mammalian systems. While this effect 
can be circumvented by bypassing the Dicer cleavage step and directly introducing 
siRNA, this tactic carries with it the risk that the chosen siRNA sequence may be non- 
functional or semi-functional. - ^ 

25 A number of researches have expressed the view that siRNA design is not a 

crucial element of RNAi. On the other hand, others in the field have begun to explore 
the possibility that RNAi can be made more efficient by paying attention to the design, 
of the siRNA. Unfortunately, none of the reported methods have provided a 
satisfactory scheme for reliably selecting siRNA with acceptable levels of 

30 functionality. Accordingly, there is a need to develop rational criteria by which to 
select siRNA with an acceptable level of functionality, and to identify siRNA that 
have this improved level of functionality, as well as to identify siRNAs that are 
hyperfanctional. 



Summary of the Invention 

The present invention is directed to increasing the efficiency of RNAi, 
particularly in mammalian systems. Accordingly, the present invention provides kits, 
siRNAs and methods for increasing siRN A efficacy. 

5 

According to one embodiment, the present invention provides a kit for gene 
silencing, wherein said kit is comprised of a pool of at least two siRNA duplexes, 
each of which is comprised of a sequence that is complementary to a portion of the 
sequence of one or more target messenger RNA. 

10 

According to a second embodiment, the present invention provides a method 
for optimizing RNA interference by using one or-fffore siRNAs that are optimized 
according to a formula (or algorithm) selected from: 
Formula I 

1 5 Relative fimctionality of siRNA= -(GC/3) +(AUi5-i9) -(Tm2o«c)*3 -(Gi3)*3 -(C19) 
+(A,9)*2 +(A3) +(Uio)+(Ai4) -(Us) -(An) 



Formula II 

Relative functionality of siRNA= -(GC/3) -(AUi5-i9)*3 -{Gn)*3 -(C,9) +(A,9)*2 
20 +(A3) 

Formula III 

Relative functionality of siRNA= -(GC/3) +(AUi5-i9) -(Tm2o-c)*3 - 

25 Formula IV 

Relative fimctionality of siRNA= 

-GC/2+( AU,5-i9)/2-(Tm2o-c)*2 -(Gi3)*3 -(C19) +(Ai9)*2 +(A3) +(U,o)+(Ai4) -(U5) - 
(A„) 

30 Formula V 

Relative functionality of siRNA=-(Gi3)*3 -(C,,) +(A,9)*2 +(A3) + (U,o)+(Ai4) -(Us) 
-(An) 

Formula VI 



Relative ftinctionality of siRNA=-(Gi3)*3 -(C19) +(Ai9)*2 +(A3) 



Formula VII 

Relative functionality of siRNA=-(GC/2) +(AUi5.i9)/2 -( Tniao^c)*! -{Gi3)*3 ^€19) 
5 +(Ai9)*3 +(A3)*3 +(Uio)/2+(A,4)/2 -(U5)/2 -<An)/2 

wherein in Formulas I - VII: 

Tm 2o»c=l if the Tm is greater than 20°C; 
10 A19 = 1 if A is the base at position 19 on the sense strand, otherwise its value 

is 0; 

AU15.J9 = 0-5 depending on the number of A'-or U bases on the sense strand 

at 

positions 15-19; 

15 Gi3 = 1 if G is the base at position 1 3 on the sense strand, otlierwise its value is 

0; 

Ci9 = 1 if C is the base at position 19 of the s^ise strand, otherwise its value is 

0; 

GC= the number of G and C bases in the entire sense strand; 
20 A3= 1 if A is the base at position 3 on the sense strand, otherwise its value is 0; 

Ai 1= 1 if A is the base at position 1 1 on the sense strand, otherwise its value is 

0; 

Ai4= 1 if A is the base at position 14 on the sense strand^ otherwise its value is 

0; 

25 Uio= 1 if U is the base at position 10 on the sense strand, otherwise its value is 

U5= 1 if U is the base at position 5 on the sense strand, othemise its value is 0; 

or 

30 Formula VIII Relative functionality of siRNA = 

(-14)*Gi3-13*Arl2*U7-n*U2-10*Aii-10*U4-10*C3-10*C5"^ 
9*Aio-9*U9-9*Ci8-8*G,o-7*U,-7*Ui6-7*Ci7-7*Ci9 
+7*Ui7+8*A2+8*A4+8*A5+8*C4+9*G8+10*A7+10*Ui8+ll*Ai9+ 



11*C9+15*G,+ 18*A3+19*Uio-Tm-3* (GCtotai)-6*(GCi5-i9> 
30*X; and 



Fonnula DC Relative functionality of siRNA = 
5 (14.1)*A3+(14.9)*A6+(17.6)*Ai3+(24.7)*Ai9+(14.2)*Ui^ 

C9+(23.9)*Gi+(163)*G2+(-123)*Aii+(-193)*Ui+(-12.1)*U2+ 
(.ll)*U3+(-15.2)*Ui5+(-113)*Ui6+(-11.8)*C3+(-17.4)*Q^^^ 
ia5)*C7+ (-13.7)*Gi3+(-25.9)*Gi9-Tm"3* (GCtotai) - 6*(GCi5-i9)- 
30*X 

10 wherein 

Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 

A2 = 1 if A is the base at position 2 of the sense strand, bttierwise its value is 0; 

A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 

A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
15 As = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 

A6 = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 

A7 = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 

Aio= 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 

An = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 
20 Ai3 = 1 if A is the base atT>bsition 13 of the sense strand, otherwise its value is 0; 

Ai9 = 1 if A is the base at position 1 9 9f the sense strand, otherwise if anotlier base i 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 

C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
25 C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 

C5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 

Ce = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C7 == 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C9 = 1 if C is the base at position 9 of &e sense strand, otherwise its value is 0; 
30 Ci7 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 

C18 = 1 if C is the base at position 1 8 of the sense strand, otherwise its value is 0; 

Ci9 = 1 if C is the base at position 1 9 of the sense strand, otherwise if another base 
present or the sense strand is only 18 base pairs in length, its value is 0; 



Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
62= 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
Gio= 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 
5 Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
Gi9= 1 if G is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
10 U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 

U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 

U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 

U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 

U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
15 Uio = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 

Ui5 = 1 if U is the base at position 15 on the sense strand, otiierwise its value is 0; 

U16 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 

Ui7 == 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 

U18 = 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 
20 r^' 

GCi5_i9 = the number of G and C bases within positions 15 — 19 of the sense strand 
or within positions 15 -18 if the sense strand is only 18 base pairs in length; 

GCtotai= the number of G and Chases in the sense strand; .-jr:- 

Tm =100 if the targeting site contains an inverted repeat longer tiian 4 base pairs, 
25 otiierwise its value is 0; and 

X = the number of times that the same nucleotide repeats four or more times in a row. 

According to a third embodiment, the present invention is directed to a kit 
comprised of at least one siRNA that contains a sequence that is optimized according 
30 to one of the formulas above. Preferably the kit contains at least two optimized 
siRNA, each of which comprises a duplex, wherein one strand of each duplex 
comprises at least eighteen contiguous bases that are complementary to a region of a 
target messenger RNA. For mammalian systems, the siRNA preferably comprises 
between 18 and 30 nucleotide base pairs. 



The ability to use the above algorithms, which are not sequence or species 
specific, allows for the cost-effective selection of optimized siRNAs for specific 
target sequences. Accordingly, there will be both greater efficiency and reliability in 
the use of siRNA technologies. 

According to a fourth embodiment, the present invention provides a method 
for developing an siRNA algorithm for selecting functional and hyperfunctional 
siRNAs for a given sequence. The method comprises: 

(a) . selecting a set of siRNAs; 

(b) measuring the gene silencing ability of each siRNA fi-om said set; 

(c) detemiining the relative functionality of each siRNA; 

(d) determining the amount of improved functionality by the presence or 
absence of at least one variable selected from the group consisting of 
the total GC content, melting temperature of the siRNA, GC content at 
positions 15-19, the presence or absence of a particular nucleotide at a 
particular position and the number of times that the same nucleotide 
repeats within a given sequence; and 

(e) developing an algorithm using the information of step (d). 

According to this embodiment, preferably the set of siRNAs comprises at least 
90 siRNAs fi-om at least one gene, more preferably at least 180 siRNAs fi-om at least 
two different genes, and most preferably at least 270 and 360 siRNAs firom at least 
three and four different genes, respectively. Additionally, in step (d) flie 
determination is made with preferably at least two, more preferably at least three, 
even more preferably at least four, and most preferably all of the variables. The 
resulting algorithm is not target sequence specific. 

In a fifth embodiment, the present invention provides rationally designed 
siRNAs identified using the formulas above. 



In a sixth embodiment, the present invention is directed to hyperfbnctional 

siRNA. 



For a better understanding of the present invention together with other and 
further advantages and embodiments, reference is made to the following description 
taken in conjunction with the examples, the scope of which is set forth in the 
appended claims. 
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Brief Description of the Figures 

Figure 1 shows a model for siRNA-RISC interactions. RISC has the ability to interact 
with either end of the siRNA or miRNA molecule. Following binding^ the duplex is 
unwoxmd, and the relevant target is identified, cleaved, and released. 

10 

Figure 2 is a representation of the functionality of two hundred and seventy siRNA 
duplexes that were generated to target human cyclophilin, human diazepam-binding 
inliibitor (DB), and firefly luciferase. 

1 5 Figure 3a is a representation of the silencing effect of 30 siRNAs in three different 
cells lines, HEK293, DU145, and Hela. Figure 3b shows the frequency of different 
functional groups (>95% silencing (black), >80% silencing (gray), >50% silencing 
(dark gray), and <50% silencing (white)) based on GC content. In cases where a given 
bar is absent from a particular GC percentage, no siRNA were identified for that 

20 particular group. Figure 3c shows the frequency of different functional groups based 
on melting temperature (Tm). Again, each group has four different divisions: >95% 
(black), >80% (gray), >50% (dark gray), and <50% (white) silencing. 

Figure 4 is a representation of a statistical analysis that revealed correlations between 
25 silencing and five sequence-related properties of siRNA: (A) an A at position 19 of 
the sense strand, (B) an A at position 3 of the sense strand, (C) a U at position 10 of 
the sense strand, (D) a base other than G at position 13 of the sense strand, and (E) a 
base other than C at position 19 of the sense strand. All variables were correlated with 
siRNA silencing of firefly luciferase and human cyclophilin. SiRNAs satisfying the 
30 criterion are grouped on the left (Selected) while those that do not, are grouped on the 
right (Eliminated). Y-axis is "% Silencing of Control.'' Each position on the X-axis 
represents a unique siRNA. 



Figures 5 A and 5 B are representations of firefly luciferase and cyclophilin siRNA 
panels sorted according to functionality and predicted values using Formula VIII. The 
siRNA found within the circle represent those that have Formula VIII values 
(SMARTscores™) above zero. SiRNA outside the indicated area have calculated 
Formula VIII values that are below zero. Y-axis is "Expression (% Control)." Each 
position on the X-axis represents a unique siRNA. 

Figure 6A is a representation of the average intemal stability profile (AISP) derived 
fi-om 270 siRNAs taken firom three separate genes (cyclophihn B, DBI and firefly 
luciferase). Graphs represent AISP values of highly functional, functional, and non- 
functional siRNA. Figure 6B is a comparison between the AISP of naturally derived 
GFP siRNA (filled squares) and the AISP of siRNA fi-om cyclophilin B,-DBI, and 
luciferase having >90% silencing properties (no fill) for the antisense strand. "DO" is 
the symbol for AG, fi^ee energy. 

Figure 7 is a histogram showing the differences in duplex functionality upon 
introduction of basepair mismatches. The X-axis shows the mismatch introduced inot 
the siRNA and the position it is introduced (e.g., 8C->A reveals that position 8 (which 
normally has a C) has been changed to an A). The Y-axis is "% Silencing 
(Normalized to Control)." - 

Figure 8a is histogram that shows the effects of S'sense and antisense strand 
- modification with 2'-0-methylation on functionality. Figure 8b is an expression 
profile showing a comparison of sense strand off-target effects for IGFlR-3 and 2'-0- 
methyl IGFlR-3. Sense strand off-targets (lower white box) are not induced when the 
5' end of the sense strand is modified with 2'-0-methyl groups (top white box). 

Figure 9 shows a graph of SMARTscores™ versus RNAi silencing values for more 
than 360 siRNA directed against 30 different genes. SiRNA to the right of the vertical 
bar represent those siRNA that have desirable SMARTscores™. 

Figures lOA - E compare the RNAi of five different genes (SEAP, DBI, PLK, 
Firefly Luciferase, and Renila Luciferase) by varying numbers of randomly selected 
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siRNA and four rationally designed (SMART-selected) siRNA chosen using the 
algorithm described in Formula VIIL hi addition, RNAi induced by a pool of the four 
SMART-selected siRNA is reported at two different concentrations (100 and 40QnM). 
lOF is a comparison between a pool of randomly selected EGFR siRNA (Pool 1) and 
5 a pool of SMART selected EGFR siRNA (Pool 2). Pool 1, SI— S4 and Pool 2 SI— 
S4 represent the individual members that made up each respective pool. Note that 
numbers for random siRNAs represent the position of the 5' end of the sense strand of 
the duplex. The Y-axis represents the % expression of the control(s). The X-axis is 
the percent expression of the control. 

10 

Figure 11 shows the Western blot results from cells treated with siRNA directed 
against twelve different genes involved in the clathrin-dependent endocytosis* 
pathway (CHC, Dynll, CALM, CLCa, CLCb, Epsl5, EpslSR, Rab5a, Rab5b, RabSc, 
P2 subunit of AP-2 and EEA.l). SiRNA were selected using Formula VIII. "Pool" 
15 represents a mixture of duplexes 1-4. Total concentration of each siRNA in the pool 
is 25 nM. Total concentration = 4 x 25 = 100 nM. 

Figure 12 is a representation of the gene silencing capabilities of rationally-selected 
siRNA directed against ten different genes (human and mouse cyclophilin, C-myc, 
20 human lamin A/C, QB (ubiquiaol-cytochrome c reductase core protein I), MEKl and 
MEK2, ATEl (argmyl-tRNA protein transferase), GAPDH, and Eg5). The Y-axis is 
the percent expression of the control. Numbers 1 , 2, 3 and 4 represent individual 
rationally selected siRNA. "Pool" represents a mixture of the four individual siRNA. 

25 Figure 13 is the sequence of the top ten Bcl2 siRNAs as detennined by Formula VIII. 
Sequences are Hsted 5' to 3', 

Figure 14 is the knockdown by tiie top ten Bcl2 siRNAs at lOOnM concentrations. 
The Y-axis represents the amount of expression relative to the non-specific (ns) and 
30 transfection mixture control. 



Figure 15 represents a functional walk where siRNA beginning on every other base 
pair of a region of the luciferase gene are tested for the ability to silence the luciferase 



gene. The Y-axis represents the percent expression relative to a control. The X-axis 
represents the position of each individual siRNA. 

Figure 16 is a histogram demonstrating the inhibition of target gene expression by 
pools of 2 and 3 siRNAs duplexes taken from the walk described in Figure 1 5. The 
Y-axis represents the percent expression relative to controL The X-axis represents the 
position of the first siRNA in paired pools, or trios of siRNA. For instance, the first 
paired pool contains siRNA 1 and 3. The second paired pool contains siRNA 3 and 5. 
Pool 3 (of paired pools) contains siRNA 5 and 7, and so oil 

Figure 17 is a histogram demonstrating tiie inhibition of target gene expression by 
pools of 4 and 5 siRNA duplexes. The Y-axis represents the percent expression- " 
relative to a controL The X-axis represents the positoin of the first siRNA in each 
pooh 

Figure 18 is a histogram demonstrating the inhibition of target gene expression by 
siRNAs that are ten and twenty basepairs apart. The Y-axis represents the percent 
expression relative to a controL The X-axis represents the position of the first siRNA 
in each pooL 

Figure 19 shows that pools of siRNAs (dark gray bar) work as well (or better) than 
the best siRNA in the pool (light gray bar). The Y-axis represents the percent 
expression relative to a controL The-X axis represents the position of the first siRNA 
in each pool. 

Figure 20 shows that the combination of several semifiinctional siRNAs (dark gray) 
result in a significant improvement of gene expression inhibition over individual 
(semi-fianctional; light gray) siRNA. The Y-axis represents the percent expression 
relative to a controL 

Figure 21 shows both pools (Library, Lib) and individual siRNAs in inhibition of 
gene expression of Beta-Galactosidase, Renilla Ludferase and SEAP (alkaline 
phosphatase). Numbers on the X-axis indicate the position of the 5'-most nucleotide 
of the sense strand of the duplex. The Y-axis represents the percent expression of 



each gene relative to a control. Libraries contain siRNAs that begin at the following 
nucleotides: Seap: Lib 1: 206, 766, 812,923, Lib 2: 1117, 1280, 1300, 1487, Lib 3: 
206, 766, 812, 923, 1 1 17, 1280, 1300,1487, Lib 4: 206, 812, 1 1 17, 1300, Lib 5: 766, 
923, 1280, 1487, Lib 6: 206, 1487; Bgal: Lib 1: 979, 1339, 2029, 2590, Lib 2: 
5 1087,1783,2399,3257, Lib 3: 979, 1783, 2590, 3257, Lib 4: 979, 1087, 1339, 1783, 
2029, 2399,2590,3257, Lib 5: 979, 1087, 1339, 1783, Lib 6: 2029,2399,2590,3257; 
Renilla: Lib 1: 174,300,432,568, Lib 2: 592, 633, 729,867, Lib 3: 174, 300, 432, 
568, 592, 633,729,867, Lib 4: 174,432,592,729, Ub5: 300,568,633,867, Lib 6: 
592,568. 
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Figure 22 showS the results of an EGFR and TfiaR internalization assay when single 
gene knockdowns are performed. The Y-axis represents percent intemaUzation 
relative to control. 

15 

Figure 23 shows the results of an EGFR and TfiiR internalization assay when 
multiple genes are knocked down (e.g, Rab5a, b, c). The Y-axis represents the 
percent intemalization relative to control. 

20 Figure 24 shows the simultaneous knockdown of four different genes. SiRNAs 

directed against G6PD, GAPDH, PLK, and UBQ were simultaneously introduced into 
cells. Twenty-four hours later, cultures were harvested and assayed for mRNA target 
levels foi:,each of the four genes. A comparison is made between cells transfected 
with individual siRNAs vs. a pool of siRNAs directed against all four genes. 

25 

Figure 25 shows the functionality of ten siRNAs at 0.3nM concentrations. 

Detailed Description 
Definitions 

30 Unless stated otherwise, the following terms and phrases have the meanings 

provided below: 



siRNA 

The tenii "siRNA'' refers to small inhibitory RNA duplexes that induce the 
RNA interference (RNAi) pathway. These molecules can vary in length (generally 
between 18-30 basepairS) and contain varying degrees of complementarity to their 
target mRNA in the antisense strand. Some, but not all, siRNA have unpaired 
overhanging bases on the 5' or 3' end of the sense strand and/or the antisense strand. 
The term "siRNA" includes duplexes of two separate strands, as well as single strands 
that can form hairpin structures comprising a duplex region. 

SiRNA may be divided into five (5) groups (non-functional, semi-functional, 
functional, highly functional, and hyper-functional) based on the level or degree of 
sileacing that they induce in cultured cell lines. As used herein, these definitions are 
based on a set of conditions where the siRNA is transfected into said cell line at a 
concentration of lOOnM and the level of silencing is tested at a time of roughly 24 
hours after transfection, and not exceeding 72 hours after transfection. In this context, 
"non-functional siRNA" are defined as those siRNA that induce less than 50% 
(<50%) target silencing. "Semi-functional siRNA" induce 50-79% target silencing. 
"Functional siRNA" are molecules that induce 80-95% gene silencing. "Highly- 
functional siRNA" are molecules that induce greater than 95% gene silencing. 
"Hyperfunctional siRNA" are a special class of molecules: For purposes of this 
document, hyperfunctional siRNA are defined as those molecules that: (1) induce 
greater than 95% silencing of a specific target when they are transfected at 
subnanomolar concentrations (i.e., less than one nanomolar); and/or (2) induce 
functional (or better) levels of silencing for greater than 96 hours. These relative 
functionalities (though not intended to be absolutes) may be used to compare siRNAs 
to a particular target for applications such as functional genomics, target identification 
and therapeutics, 

miRNA 

The term "miRNA" refers to microRNA. 

Gene silencing: 

The phrase "gene silencing" refers to a process by which the expression of a 
specific gene product is lessened or attenuated. Gene silencing can take place by a 



variety of pathways. Unless specified otherwise, as used herin, gene silencing refers 
to decreases in gene product expression that results from RNA interference (RNAi), a 
defined, though partially characterized pathway whereby small inhibitory RNA 
(siRNA) act in concert with host proteins (e,g, the RNA induced silencing complex, 
5 RISC) to degrade messenger RNA (mRNA) in a sequence-dependent fashion. The 
level of gene silencing can be measured by a variety of means, including, but not 
limited to, measurement of transcript levels by Northern Blot Analysis, B-DNA 
techniques, transcription-sensitive reporter constructs, expression profiling (e.g, DNA 
chips), and related technologies. Altematively, the level of silencing can be measured 
10 by assessing the level of the protein encoded by a specific gene. This can be 
accompHshed by performing a number of studies including Western Analysis, 
measuring the levels of expression of a reporter protein that has e.g. fluorescent 
properties (e.g. GFP) or enzymatic activity (e.g. alkaline phosphatases), or several 
other procedures. 
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Transfection 

The term "transfection" refers to a process by which agents are introduced into 
a cell. The Kst of agents that can be transfected is large and includes, but is not 
limited to, siRNA, sense and/or anti-sense sequences, DNA encoding one or more 
20 genes and organized into an expression plasmid, proteins, protein fi-agments, and 

more. There are multiple mefliods for transfecting agents into a cell including, but not 
lunited to, electroporation, calcium phosphate-based transfections, DEAE-dextran- 
based transfectiens, lipid-based transfections, molecular conjugate-based transfections 
(e.g, polylysine-DNA conjugates), microinjection and others. 

25 

Target 

The terra "target" is used in a variety of different forms throughout this 
document and is defined by the context in which it is used. "Target mRNA" refers to 
a messenger RNA to which a given siRNA can be directed against. "Target sequence" 
30 and "target site" refer to a sequence within the mRNA to which the sense strand of an 
siRNA shows varying degrees of homology and the antisense strand exhibits varying 
degrees of complementarity. The term "siRNA target" can refer to the gene, mRNA, 
or protein against which an siRNA is directed. Similarly "target silencing" can refer 
to the state of a gene, or the corresponding mRNA or protein. 



Off-target silencing and Off-target interference 

The phrases "off-target silencing" and "off-target interference" are defined as 
degradation of mRNA other than the intended target mRNA due to overlapping and/or 
partial homology with secondary mRNA messages. 

SMARTscore'TM 

The term "SMARTscore™" refers to a number determined by applying any of 
the Formulas I - Formula IX to a given siRNA sequence. The term "SMART- 
selected" or "rationally selected" or "rational selection" refers to siRNA that have 
been selected on the basis of their SMARTscores™. 

Complementary 

The term "complementary" refers to the ability of polynucleotides to form 
base pairs with one another. Base pairs are typically formed by hydrogen bonds 
between nucleotide units in antiparallel polynucleotide strands. Complementary 
polynucleotide strands can base pair in the Watson-Crick maimer (e.g., A to T, A to 
U, C to G), or in any other manner that allows for the formation of duplexes. As 
persons skilled in the art are aware, when using RNA as opposed to DN A, uracil 
rather than thymine is the base that is considered to be complementary to adenosine. 
However, when a U is denoted in the context of the present invention, the ability to 
substitute a T is implied, unless otherwise stated. 

Perfect complementarity or 100% complementarity refers to the situation in 
which each nucleotide \jnit of one polynucleotide strand can hydr^ogen bond with a 
nucleotide unit of a second polynucleotide strand. Less than perfect complementarity 
refers to the situation in which some, but not all, nucleotide units of two strands can 
hydrogen bond with each other. For example, for two 20-mers, if only two base pairs 
on each strand can hydrogen bond with each other, the polynucleotide strands exhibit 
10% complementarity. In the same example, if 18 base pairs on each strand can 
hydrogen bond with each other, the polynucleotide strands exhibit 90% 
complementarity. "Substantial complementarity" refers to polynucleotide strands 
exhibiting 79% or greater complementarity, excluding regions of the polynucleotide 
strands, such as overhangs, that are selected so as to be noncomplementary. 
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("Substantial similarity" refers to polynucleotide strands exhibiting 79% or greater 
similarity, excluding regions of the polynucleotide strands, such as overhangs, that are 
selected so as not to be similar.) Thus, for example, two polynucleotides of 29 
nucleotide units each, wherein each comprises a di-dT at the 3' terminus such that the 
5 duplex region spans 27 bases, and wherein 26 of the 27 bases of the duplex region on 
each strand are complementary, are substantially complementary since they are 96.3% 
complementary whoi excluding the di-dT overhangs. 

Deoxvnucleotide 

1 0 The term "deoxynucleotide" refers to a nucleotide or polynucleotide lacking a 

hydroxyl group (OH group) at the 2' and/or 3' position of a sugar moiety. Instead, it 
has a hydrogen bonded to the 2' and/or 3' carbon. Within an RNA molecule that 
comprises one or more deoxynucleotides, "deoxynucleotide" refers to the lack of an 
OH group at the 2' position of the sugar moiety, having instead a hydrogen bonded 

1 5 directly to the 2' carbon. 

Deoxvribonuclentide 

The terms "deoxyribonucleotide" and "DNA" refer to a nucleotide or 
polynucleotide comprising at least one sugar moiety that has an H, rather than an OH, 
20 at its 2' and/or 3'position. 

Substantially Similar 

The phraser^'substantially similar" refers to a similarity of at least 90% with 
respect to the identity of the bases of the sequence. 

25 

Duplex Reg ion 

The phrase "duplex region" refers to the region in two complementary or 
substantially complementary polynucleotides that form base pairs with one another, 
either by Watson-Crick base pairing or any other manner that allows for a stabilized 
30 duplex between polynucleotide strands that are complementary or substantially 

complementary. For example, a polynucleotide strand having 21 nucleotide units can 
base pair with another polynucleotide of 21 nucleotide units, yet only 19 bases on 
each strand are complementary or substantially complementary, such that the "duplex 
region" has 19 base pairs. The remaining bases may, for example, exist as 5' and 3' 



overhangs. Further, within the duplex region, 100% complementarity is not required; 
substantial complementarity is allowable within a duplex region. Substantial 
complementarity refers to 79% or greater complementarity. For example, a mismatch 
in a duplex region consisting of 19 base pairs results in 94.7% complementarity, 
5 rendering the duplex region substantially complementary. 

Nucleotide 

The term "nucleotide" refers to a ribonucleotide or a deoxyribonucleotide or 
modified form thereof, as well as an analog thereof. Nucleotides include species that 
10 comprise purines, adenine, hypoxanthine, guanine, and their derivatives and 
analogs, as well as pyrimidines, e.g., cytosine, uracil, thymine, and their derivatives 
and analogs; 

Nucleotide analogs include nucleotides having modifications in the chemical 
1 5 structure of the base, sugar and/or phosphate, including, but not limited to, 5-position 
pyrimidine modifications, 8-position purine modifications, modifications at cytosine 
exocyclic amines, and substitution of 5-bromo-uracil; and 2'-positi6n sugar 
modifications, including but not limited to, sugar-modified ribonucleotides in which 
the 2'-0H is replaced by a group such as an H, OR, R, halo, SH, SR, NH2, NHR, 
20 NR2, or CN, wherein R is an alkyl moiety. Nucleotide analogs are also meant to 
include nucleotides with bases such as inosine, queuosine, xanthine, sugars such as 
2'-methyl ribose, non-natural phosphodiester linkages such as methylphosphonates, 
phosphorothioates and-peptides. 

25 Modified bases refer to nucleotide bases such as, for example, adenine, 

guanine, cytosine, thymine, uracil, xanthine, inosine, and queuosine that have been 
modified by the replacement or addition of one or more atoms or groups. Some 
examples of types of modifications that can comprise nucleotides that are modified 
with respect to the base moieties include but are not limited to, alkylated, halogenated, 

30 thiolated, aminated, amidated, or acetylated bases, individually or in combination. 
More specific examples include, for example, 5-propynyluridine, 5-propynylcytidine, 
6-methyladenine, 6-methylguanine, N,N,-dimethyladenine, 2-propyladenine, 2- 
propyl guanine, 2-aminoadenine, 1-methylinosine, 3-methyluridine, 5-methylcytidine, 
5-methyluridine and other nucleotides having a modification at the 5 position, 5-(2- 



amino)propyl uridine, 5-halocytidine, 5-halouridine, 4-acetylcytidine, 1- 
methyladenosine, 2-inethyladenosine, 3-methylcytidine, 6-methyluridine, 2- 
methylguanosine, 7-methylguanosine, 2,2-dimethylguanosine, 5- 
methylaminoethyluridine, 5-methyloxyuridine, deazanucleotides such as 7-deaza- 
5 adenosine, 6-azouridine, 6-azocytidine, 6>azothymidine, 5-methyI-2-thiouridine, other 
thio bases such as 2-thiouridine and 4-thiouridine and 2-thiocytidine, dihydrouridine, 
pseudouridine, queuosine, archaeosine, naphthyl and substituted naphthyl groups, any 
O- and N-alkylated purines and pyrimidines such as N6-mefhyladenosine, 5- 
methylcarbonybnethyluridine, uridine 5-oxyacetic acid, pyridine-4-one, pyridine-2- 

1 0 one, phenyl and modified phenyl groups such as aminophenol or 2,4,6-trimethoxy 
benzene, modified cytosines that act as G-clamp nucleotides, 8-substituted adenines 
and guanines, 5-substituted uracils and thymines, azapyrimidines, 
carboxyhydroxyalkyl nucleotides, carboxyalkylaminoalkyl nucleotides, and 
alkylcarbonylalkylated nucleotides. Modified nucleotides also include those 

1 5 nucleotides that are modified with respect to the sugar moiety, as well as nucleotides 
having sugars or analogs thereof that are not ribosyl. For example, tiie sugar moieties 
may be, or be based on, mannoses, arabinoses, glucopyranoses, galactopyranoses, 4'- 
thioribose, and other sugars, heterocycles, or carbocycles. 

20 The temi nucleotide is also meant to include what are known in the art as 

universal bases. By way of example, universal bases include but are not lunited to 3- 
nitropyrrole, 5-nitroindoIe, or nebularine. The term "nucleotide" is also meant to 
include the N3' to P5%phQsphfM^idate, resulting fi^om the substitution of a ribosyl 3' 
oxygen with an amine group. 

25 

Further, the term nucleotide also includes those species that have a detectable 
label, such as for example a radioactive or fluorescent moiety, or mass label attached 
to the nucleotide. 

30 Polynucleotide 

The term "polynucleotide" refers to polymers of nucleotides, and includes but 
is not limited to DNA, RNA, DNA/RNA hybrids including polynucleotide chains of 
regularly and/or irregularly alternating deoxyribosyl moieties and ribosyl moieties 
(Le., wherein altemate nucleotide units have an -OH, then and -H, then an -OH, then 



an -H, and so on at the 2' position of a sugar moiety), and modifications of these 
kinds of polynucleotides, wherein the attachment of various entities or moieties to the 
nucleotide units at any position are included. 

Polyribonucleotide 

The term "polyribonucleotide" refers to a polynucleotide comprising two or 
more modified or unmodified ribonucleotides and/or their analogs. The term 
"polyribonucleotide" is used interchangeably with the term "oligoribonucleotide." 

Ribonucleotide and ribonucleic acid 

The term "ribonucleotide" and the phrase "ribonucleic acid" (RNA), refer to a 
modified or unmodified nucleotide or polynucleotide comprising at least one 
ribonucleotide unit. A ribonucleotide unit comprises an hydroxyl group attached to 
the 2' position of a ribosyl moiety that has a nitrogenous base attached in N- 
glycosidic linkage at the T position of a ribosyl moiety, and a moiety that either 
allows for linkage to another nucleotide or precludes linkage. 

Detailed Description of the Invention 

The present invention is directed to improving the efficiency of gene silendng 
by siRNA. Through the inclusion of multiple siRNA sequences that are targeted to a 
particular gene and/or selecting an siRNA sequence based on certain defined criteria, 
improved efficiency may be achieved. 

The present invention will now be described in connection with preferred 
embodiments. These embodiments are presented in order to aid in an understanding 
of the present invention and are not intended, and should not be construed, to limit the 
invention in any way. All altOTiatives, modifications and equivalents that may 
become apparent to those of ordinary skill upon reading this disclosure are included 
within the spirit and scope of the present invention. 

Furthermore, this disclosure is not a primer on RNA interference. Basic 
concepts known to persons skilled in the art have not been set forth in detail. 



Optimizing siRNA 

According to one embodiment, the present invention provides a method for 
improving the effectiveness of gene silencing for use to silence a particular gene 
through the selection of an optimal siRNA. An siRNA selected according to this 
5 method may be used individually, or in conjunction with the first embodiment, Le„ 
with one or more other siRNAs, each of which may or may not be selected by this 
criteria in order to maximize their efficiency. 

The degree to which it is possible to select an siRNA for a given mRNA that 
1 0 maximizes tiiese criteria will depend on the sequence of the mRNA itself However, 
the selection criteria will be independent of the target sequence. According to flds 
method, an siRNA is selected for a given gene by using a rational design. That said, 
rational design can be described in a variety of ways. Rational design is, in simplest 
terms, the application of a proven set of criteria that enhance the probability of 
1 5 identifying a functional or hyperfimctional siRNA. In one method, rationally 
designed siRNA can be identified by maximizing one or more of the following 
criteria: 



1. 



A low GC content, preferably between about 30 -52%. 
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2. 



At least 2, preferably at least 3 A or U bases at positions 15- 19 of the * 
siRNA on the sense strand. 



3. 



An A base at position 19 of the sense strand. 
An A base at position^of the sense strand. 
A U base at position 10 of the sense strand. 
An A base at position 14 of the sense strand. 



4. 



5. 
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6. 



7. 



A base other than C at position 19 of the sense strand. 



8. A base other than G at position 1 3 of the sense strand. 

9. A Tm, which refers to the character of the internal repeat that results in 
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inter- or intramolecular structures for one strand of the duplex, that is 
preferably not stable at greater than 50^*0, more preferably not stable at 
greater than 3TC, even more preferably not stable at greater than 30^C 



and most preferably not stable at greater than 20*^0. 

10. A base other than U at position 5 of the sense strand. 

11. A base other than A at position 1 1 of the sense strand. 



Criteria 5, 6, 10 and 1 1 are mitior criteria, but are nonetheless desirable. 
Accordingly, preferably an siRNA will satisfy as many of the aforementioned criteria 
as possible, more preferably at least 1-4 and 7-9, and most preferably all of the 
criteria 

With respect to the criteria, GC content, as well as a high number of AU in 
positions 15-19, may be important for easement of the unwinding of double stranded 
siRNA duplex. Duplex imwinding has been shown to be crucial for siRNA 
functionality in vivo. 

With respect to criterion 9, the internal structure is measured in terms of the 
melting temperature of the single strand of siRNA, which is the temperature at which 
50% of the molecules will become denatured. With respect to criteria 2-8 and 10 - 
1 1, the positions refer to sequence positions on the sense strand, which is the strand 
that is identical to the mRNA. 

In one preferred embodiment, at least criteria 1 and 8 are satisfied. In another 
preferred embodiment, at least criteria 7 and 8 are satisfied. In still another preferred 
embodiment, at least criteria 1, 8 and 9 are satisfied. 

It should be noted that all of the aforementioned criteria regarding sequence 
position specifics are with respect to^fe^&end of the sense strand. Reference is 
made to the sense strand, because most databases contain information that describes 
the information of the mRNA. Because according to the present invention a chain can 
be from 1 8 to 30 bases in length, and the aforementioned criteria assumes a chain 19 
base pairs in length, it is important to keep the aforementioned criteria applicable to 
the correct bases. 

When there are only 18 bases, the base pair that is not present is the base pair 
that is located at the 3' of the sense strand. When there are twenty to thirty bases 
present, then additional bases are added at the 5' end of the sense chain and occupy 
positions ~1 to '1 1 . Accordingly, with respect to SEQ. ID NO. 0001 . 
NNANANNNNUCNAANNNNA and SEQ. ID NO. 0028. 



GUCNNANANNNNUCNAANNNNA, both would have A at position 3, A at 
position 5, U at position 1 0, C at position 1 1, A and position 13, A and position 14 
and A at position 1 9. Howevesr, SEQ. ID NO. 0028 would also have C at position -1 , 
U at position -2 and G at position -3. 

5 

For a 19 base pair siRNA, an optimal sequence of one of the strands may be 
represented below, where N is any base. A, C, G, or U: 

SEQ. ID NO. 0001. NNANANNNNUCNAANNNNA 
10 SEQ. ID NO. 0002. NNANANNNNUCNAANNNNA 

SEQ. ID NO. 0003." NNANANNNNUUNAANNNNA 

SEQ. ID NO. 0004. NNANAMnINNUCNCANNNNA 

SEQ. ID NO. 0005. NNANANNNNUGNCANNNNA 

SEQ. ID NO. 0006. NNANANNNNUUNCANNNNA 
15 SEQ. ID NO. 0007. NNANANNNNUCNUANNNNA 

SEQ. ID NO. 0008.. NNANANNNNUCNUANNNNA 

SEQ. ID NO. 0009. NNANANNNNUUNUANNNNA 

SEQ. ID NO. 0010. NNANCNNNNUCNAANNNNA 

SEQ. ID NO. 001 1 . NNANCNNNNUCNAANNNNA 
20 -SEQ. ID NO. 0012. NNANCNNNNUUNAANNNNA 

SEQ. ID NO. 0013. NNANCNNNNUCNCANNNNA 

SEQ. ID NO. 0014. NNANCNNNNUCNCANNNNA 

SEQ. ID NO. 0015. NNANCNNNNUUJsIfiAiNMNNA 

SEQ. ID NO. 0016. NANCNNNNUCNUANNNNA 
25 SEQ. ID NO. 0017. NNANCNNNNUCNUANNNNA 

SEQ. ID NO. 0018. NNANCNNNNUUNUANNNNA 

SEQ. ID NO. 0019. NNANCNNNNUCNAANNNNA 

SEQ. ID NO. 0020. NNANCNNNNUCNAANNNNA 

SEQ. DD NO. 0021 . NNANCNNNNUUNAANNNNA 
30 SEQ. ID NO. 0022. NNANCNNNNUCNCANNNNA 

SEQ. ID NO. 0023. NNANCNNNNUCNCANNNNA 

SEQ. ID NO. 0024. NNANCNNNNUUNCANNNNA 

SEQ. ID NO. 0025. NNANCNNNNUCNUANNNNA 

SEQ. ID NO. 0026. NNANCNNNNUCNUANNNNA 



SEQ. ID NO. 0027. NNANGNNNNNUNUANNNNA 



In one embodiment, the sequence used as an siRNA is selected by choosing 
the SiRNA that score highest according to one of the following seven algorithms that 
are represented by Formulas I - VII: 



Formula I 



Relative functionality of siRNA= -(GC/3) +(AUi5-.9) -(Tm2ooc)*3 -(G,3)*3 -(€,9) 
+(Ai9)*2+(A3) +(Uio)+(A,4)-<U5)-(Aii) 

Formula II 

Relative functionality of siRNA= ^OaS) -(AU,5.,9)*3 -(G,3)*3 -(€,9) +(A,9)*2 
■KA3) 

Formula HI 

Relative functionality of siRNA= -{GC/3) +(AUi5.i9) -(Tm2ox)*3 
Formula IV 

Relative ftinctionality of siRNA= 

-GCVZ+C AU,5.,9)/2-(Tm2ooc)*2 -(G,3)*3 -(C19) +(Ai9)*2 +(A3) +(U,o)+(A,4) -(U5) - 
(An) 

Formula V ..^„«^-a->r^ 

Relative functionality of siRNA=-(Gi3)*3 -(€,9) +(Ai9)*2 +(A3) + (UioHAh) -(Us) 
-(A„) 

Formula VI 

Relative functionality of siRNA=-(Gi3)*3 -(C19) +(Ai9)*2 +(A3) 
Formula VII 

Relative functionality of siRNA=-(GC/2) +(AUi5-i9)/2 -( Tmjooc)*! -(G,3)*3 ^€,9) 
+(Ai9)*3 +(A3)*3 +(Uio)/2+(A,4)/2 -(Us)/! -(A„)/2 



In Formulas I - VII: 



wherein A19 = 1 if A is the base at position 19 on the sense strand, otherwise its value 
is 0, 

AU15.19 = 0-5 depending on the number of A or U bases on the sense strand 

at 

positions 15-19; 

Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 

0; 

C19 = 1 if C is the base at position 19 of the sense strand, otherwise its value is 

0; 

GC= the number of G and C bases in the entire sense strand; 
Tm M'c^l if the Tm is greater than 20°C; 

A3= 1 if A is the base at position' 3 on the sense strand, otherwise its value is 0; 
U,o= 1 if U is the base at position 10 on the sense strand, otherwise its value is 

0; 

A 14 = 1 if A is the base at position 14 on the sense strand, otherwise its value 

is 0; 

Us = 1 if U is the base at position 5 on the sense strand, otherwise its value is 

0; and 

A, , = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 
0. . ■ 



Formulas I -VII provide relative infonnation regarding functionality. When 
the values for two sequences are compared for^-giveirfonnula, the relative 
functionality is ascertained; a higher positive number indicates a greater functionality. 
For example, in many applications a value of 5 or greater is benelBcial. 

Additionally, in many applications, more than one of these fomulas would 
provide useful information as to the relative functionality of potential siRNA 
sequences. However, it is beneficial to have more than one type of formula, because 
not every formula will be able to help to differentiate among potential siRNA 
sequences. For example, in particularly high GC mRNAs, formulas that take that 
parameter into account would not be useful and appUcation of formulas that lack GC 
elements (e.g., formulas V and VI) might provide greater insights into duplex 
functionality. Similarly, formula II might by used in situations where hairpin 



structures are not observed in duplexes, and formula IV might be applicable for 
sequences that have higher AU content. Thus, one may consider a particular sequence 
in light of more than one or even all of these algorithms to obtain the best 
differentiation among sequences. In some instances, application of a given algorithim 
may identify an unususally large number of potential siRNA sequences, and in those 
cases, it may be appropriate to re-analyze that sequence with a second algorithm that 
is, for instance, more stringent. Alternatively, it is conceivable that analysis of a 
sequence with a given formula yields no acceptable siRNA sequences {i.e. low 
SMARTscores™). In this instance, it may be appropriate to re-analyze that sequences 
with a second algorithm that is, for instance, less stringent In still other instances, 
analysis of a single sequence with two separate formulas may give rise to conflicting 
results (/.e. one formula generates a set of srRNA with high SMARTscores™ while 
the other formula identifies a set of siRNA with low SMARTscores™). In these 
instances, it may be necessary to determine which weighted factor(s) (e.g. GC 
content) are contributing to the discrepancy and assessing the sequ«ice to decide 
whether these factors should or should not be included. Alternatively, the sequence 
could be analyzed by a third, fourth, or fifth algorithm to identify a set of rationally 
designed siRNA. 

The above-referenced criteria are particularly advantageous when used in 
combination with pooling techniques as depicted in Table I: 



Table I 



Criteria 


Functional Probability 




Ohgos 


Pools 




>95% 


>80% 


<70% 


>95% 


>80% 


<70% 


Current 


33.0 


50.0 


23.0 


79.5 


97.3 


0.3 


New 


50.0 


88.5 


8.0 


93.8 


99.98 


0.005 


(GC) 


28.0 


58.9 


36.0 


72.8 


97.1 


1.6 



The term "current" refers to Tuschl's conventional siRNA parameters (Elbashir, S.M. 
et al. (2002) "Analysis of gene fimction in somatic mammalian cells using small 
interfering RNAs" Methods 26: 199-213). "New" refers to the design parameters 
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described in Formulas I-VIL "GC" refers to criteria that select siRNA solely on the 
basis of GC content. 



As Table I indicates, when more functional siRNA duplexes are chosen, 
5 siRNAs that produce <70% silencing drops from 23% to 8% and flie number of 
siRNA duplexes that produce >80% silencing rises from 50% to 88.5%. Further, of 
the siRNA duplexes with >80% silencing, a larger portion of these siRNAs actually 
silence >95% of the target expression (the new criteria increases the portion from 
33% to 50%). Using this new criteria in pooled siRNAs, shows that, with pooling, the 
1 0 amount of silencing >95% increases from 79.5% to 93 .8% and essentially ehminates 
any siRNA pool from silencing less than 70%. 

Table II similarly shows the particularly beneficial results of pooling in 
combination with the aforementioned criteria. However, Table II, which takes into 
1 5 account each of the aforementioned variables, demonstrates even a greater degree of 
improvement in ftinctionality. 



Table II 





Functional Probability 


Oligos 


Pools 


Functional 


Average 


Non- 
functional 


Functional 


Average 


Non- 
frmctional 


Random 


20 


40 


50 


67 


97 


3 


Criteria 1 


52 


99 


0.1 




-93 


0.0040 


Qiteria 4 


89 


99 


0.1 


99 


99 


0.0000 



The tenns "functional," "Average," and "Non-functionaF' refer to siRNA that exhibit 
20 >80%, >50%, and <50% functionality, respectively. Criteria 1 and 4 refer to specific 
criteria described above. 
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The above-described algorithms may be used with or without a computer 
program that allows for the inputting of the sequence of the mRNA and automatically 
outputs the optimal siRNA. The computer program may, for example, be accessible 



from a local terminal or personal computer, over an internal network or over the 
Internet. 

In addition to the formulas above, more detailed algorithms may be used for 
5 selecting siRNA. Preferably, at least one RNA duplex of between 1 8 and 30 base 
pairs is selected such that it is optimized according a foraiula selected from: 

Formula VIII: (-14)*Gi3-13*Arl2*U7-ll*U2-10*An-10*U4-10*C3-10*Q^ 
10 9*A,o-9*U9-9*C,8-8*Gio-7*Ur7*Ui6-7*C,7-7*C,9 

+7*Ui7+8*A2+8*A4 +8*A5+8*C4 +9*G8 +1 0*ArHl 0*Ui8+ll *A,9+ 
11*C9+15*G,+ 18*A3+19*U,o-Tm-^3* (GQotaO~6*(GC,5-i9)- 
30*X;and 

15 FormulalX: (14J)*A3+(14.9)*A6+(17.6)*A,3+(24.7)*Ai9+(14.2)*Uio+(10^^^ 
C9+(23.9)'Kji+(163)=«<}2+(-12J)*Au+(-193)*U,+^^ 
(-1 l)*U3+(-l 5.2) *Ui5-K-l 1 3) *U,6 + (-1 1 .8)*C3+(-l 7.4) *C6+(- 
10.5)*C7+ (-13.7)*G,3+(-25.9)*Gi^Tm-3* (GQotai) - 6*(GCi5-i9)- 
30*X 

20 ;. 
wherein 

Ai = 1 if A is the base at position 1 of the sense sti-and, otherwise its value is 0; 

A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 

A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 
25 A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 

A5 = I if A is the base at position 5 of the sense strand, otherwise its value is 0; 

Ae = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 

A7= 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 

Aio= 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 
30 All = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 

Ai3 = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 

Ai9 = 1 if A is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 



C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
C4 = 1 if G is the base at position 4 of the sense strand, otherwise its value is 0; 
C5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 
Q = 1 if C is the base at position 6 of the sense strand, otiierwise its value is 0; 
C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 
C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
Ci7 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
C18 = 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 
Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise if another base 
present or the sense strand is only 18 base pairs in length, its value is 0; 

G| = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
Gio = 1 if G is the base at position 1 0 on the sense strand, otherwise its value is 0; 
Gi3 = 1 if G is tlie base at position 13 on the sense strand, otherwise its value is 0; 
Gi9= 1 if G is the base at position 19 of the sense strand, otherwise if another base 
present or the saise strand is only 18 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U4 = 1 if U is the base at position 4 on the sense strand, othenadse ite value is 0; 
U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
U|o = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
Ui5 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 
U16 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 
Ui7 = 1 if U is the base at position 1 7 on the sense strand, otherwise its value is 0; 
Uig = 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 

GC15-19 = the numbo- of G and C bases within positions 15-19 of the sense 
strand, or within positions 15 -18 if the sense strand is only 1 8 base pairs in 
length; 



GCtotai = the number of G and C bases in the sense strand; 

Tm = 1 00 if the siRNA oligo has the internal repeat longer then 4 base pairs, 

otherwise its value is 0; and 
X = the number of times that the same nucleotide repeats four or more times in a 
row. 

The above formulas VIII and DC, as well as formulas I - VII, provide methods 
for selecting siRNA in order to increase the efficiency of gene silencing. A subset of 
variables of any of the formulas may be used, though when fewer variables are used, 
the optinaization hierarchy becomes less reliable. 

With respect to the variables of the above-referenced formulas, a single letter 
of A or C or G or U followed by a subscript refers to a binary condition. The binary 
condition is that either the particular base is present at that particular position 
(wherein the value is "1") or the base is not present (wherein the value is "0")- 
Because position 19 is optional, Le, there might be only 18 base pairs, when there are 
only 18 base pairs, any base with a subscript of 19 in the formulas above would have 
a zero value for that parameter. Before or after each variable is a number followed by 
*, which indicates that the value of the variable is to be multiplied or weighed by that 
number. -c" 

Tlie numbers preceding the variables A, or G, or C, or U in Formulas VIII and 
IX (or after the variables in Formula I - VII) were determined by eomparing the 
difference in the fi'equency of individual bases at different positions in fimctional 
siRNA and total siRNA. Specifically, the frequency in which a given base was 
observed at a particular position in fimctional groups was compared with the 
frequency that that same base was observed in the total, randomly selected siRNA set. 
If the absolute value of the difference between the fimctional and total values was 
found to be greater than 6%, that parameter was included in the equation. Thus for 
instance, if the frequency of finding a "G" at position 13 (G13) is found to be 6% in a 
given fimctional group, and the frequency of Gn in the total population of siRNAs is 
20%, the difference between the two values is 6%-20% = -14%. As the absolute value 
is greater than six (6), this factor (-14) is included in the equation. Thus in Formula 
VIII, in cases where the siRNA under study has a G in position 13, the accrued value 
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is (-14) * (1) = -14. In contrast, when a base other than G is found at position 13, the 
accrued value is (-14) * (0) = 0. 

When developing a means to optimize siRNAs, the inventors observed that a 
5 bias toward low internal thermodynamic stability of the duplex at the 5'-antisense 
(AS) end is characteristic of naturally occurring miRNA precursors. The inventors 
extended this observation to siRNAs for which functionality had been assessed in 
tissue culture. 

1 0 With respect to the parameto: GC15.19, a value of 0 - 5 will be ascribed 

depending on the number of G or C bases at positions 15 to 19. If there are only 18 
base pairs, the value is between 0 and 4. 

With respect to the criterion GCtotai content, a number from 0-30 will be 
1 5 ascribed, which correlates to the total number of G and C nucleotides on the sense 
strand, excluding overhangs. Without wishing to be bound by any one theory, it is 
postulated that the significance of the GC content (as well as AU content at positions 
1 5-1 9, which is a parameter for formulas III - VH) relates to the easement of the 
unwinding of a double-stranded siRNA duplex. Duplex unwmding is believed to be 
20 crucial for siRNA functionality in vivo and overall low intemal stability, especially 
low intemal stability of the first unwound base pair is believed to be important to 
maintain sufficient processivity of RISC complex-induced duplex unwinding. If the 
duplex has 19 base pairs, those at positions 15-19 on the sense strand<.willsijn.wind first 
if the molecule exhibits a sufficiently low intemal stability at that position. As 
25 persons skilled in the art are aware, RISC is a complex of approximately twelve 
proteins; Dicer is one, but not the only, helicase within this complex. Accordingly, 
althougli the GC parameters are beUeved to relate to activity with Dicer, they are also 
important for activity with other RISC proteins. 

30 The value of the parameter Tm is 0 when there are no intemal repeats longer 

than (or equal to) four base pairs present in the siRNA duplex; otherwise the value is 
1 . Thus for example, if the sequence ACGUACGU, or any other four nucleotide (or 
more) palindrome exists within the stracture, the value will be one (1). Alternatively 
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if the structure ACGGACG, or any other 3 nucleotide (or less) palindrome exists, the 
value will be zero (0). 

The variable "X" refers to the number of times that the same nucleotide occurs 
5 contiguously in a stretch of four or more units. If there are, for example, four 
contiguous As in one part of the sequence and elsewhere in the sequence four 
contiguous Cs, X =2. Further, if there are two separate contiguous stretches of four of 
the same nucleotides or eight or more of the same nucleotides in a row, then X=2. 
However, X does not increase for five, six or seven contiguous nucleotides. 

10 

Again, when applying Formula VIII or Formula IX to a given mRNA, (the 
"target RNA'* or "target molecule''), one may use a computer program to evaluate the 
criteria for every sequence of 18 - 30 base pairs or only sequences of a fixed length, 
e,g., 19 base pairs. Preferably the computer program is designed such that it provides 

15 a report ranking of all of the potential siRNAs between 1 8 and 30 base pairs, ranked 
according to which sequences generate the highest value. A higher value refers to a 
more efficient siRNA for a particular target gene. The computer program that may be 
used, may be developed in any computer language that is known to be usefiil for 
scoring nucleotide sequences, or it may be developed with the assistance of 

20 commercially available product such as Microsoft's product .net. Additionally, rather 
than run every sequence through one and/or another fomiula, one may compare a 
subset of the sequences, which may be desirable if for example only a subset are 
available. For instance, it may be desirable to first perform a BLAST (BasiGrJtocal* . 
Alignment Search Tool) search and to identify sequences that have no homology to 

25 other targets. Alternatively, it may be desirable to scan flie sequence and to identify 
regions of moderate GC context, then perform relevant calculations using one of tiie 
above-described formulas on these regions. These calculations can be done manually 
or with the aid of a computer. 

30 As with Formulas I - VII, either Formula VIII or Formula DC may be used for 

a given mRNA target sequence. However, it is possible that according to one or the 
other formula more than one siRNA will have the same value. Accordingly, it is 
beneficial to have a second formula by which to differentiate sequences. Formula DC 
was derived in a similar fashion as Formula VIII, yet used a larger data set and thus 



yields sequences with higher statistical correlations to highly functional duplexes. The 
sequence that has die highest value ascribed to it may be referred to as a "first 
optimized duplex." The sequence that has the second highest value ascribed to it may 
be referred to as a "second optimized duplex." Similarly, the sequences that have the 
5 third and fourth highest values asaribed to them may be referred to as a third 

optimized duplex and a fourth optimized duplex, respectively. When more than one 
sequence has the same value, each of them may, for example, be referred to as first 
optimized duplex sequences or co-first optimized duplexes, 

1 0 SiRN A sequences identified using Formula VIII are contained within flie 

enclosed compact disks. The data included on the enclosed compact disks is 
described more fully below. The sequences identified by Formula VHI that are 
disclosed in the compacts disks may be used in gene silencing applications. 

1 5 It should be noted that for Formulas VIH and IX aU of the aforementioned 

criteria are identified as positions on the sense strand when oriented in the 5' to 3' 
direction as they are identified in connection with Formulas I - VH unless othawise 
specified. 



20 Formulas I - DC, may be used.ta select or to evaluate one, or more than one, 

siRNA in order to optimize silencing. Preferably, at least two optimized siRNAs that 
have been selected according to at least one of these formulas are used to silence a 
gene, more preferably at least three and most preferably at least four. The siRNAs^...-* 
may be used individually or together in a pool or kit. Further, they may be applied to 

25 a cell simultaneously or separately. Preferably, the at least two siRNAs are applied 
simultaneously. Pools are particularly beneficial for many research applications. 
However, for therapeutics, it may be more desirable to employ a single 
hyperflinctional siRNA as described elsewhere in fliis application. 



When planning to conduct gene silencing, and it is necessary to choose 
betweoi two or more siRNAs, one should do so by comparing the relative values 
when the siRNA are subjected to one of the formulas above. In general a higher 
scored siRNA should be used. 



Useful applications include, but are not limited to, target validation, gene 
functional analysis, research and drug discovery, gene therapy and therapeutics. 
Methods for using siRNA in these applications are well known to persons of skill in 
the art. 

5 

Because the ability of siRNA to function is dependent on the sequence of the 
RNA and not the species into which it is introduced, the present invention is 
applicable across a broad range of species, including but not limited to all mammalian 
species, such as humans, dogs, horses, cats, cows, mice, hamsters, chimpanzees and 
1 0 gorillas, as well as other species and organisms such as bacteria, viruses, insects, 
plants and C elegans. 

The present invention is also applicable for use for silencing a broad range of 
genes, including but not limited to the roughly 45,000 genes of a human genome, and 
15 has particular relevance in cases where those genes are associated with diseases such 
as diabetes, Alzheimer's, cancer, as well as all genes in the genomes of the 
aforementioned organisms. 

The siRNA selected according to the aforementioned critma or one of the 
20 aforementioned algorithms are also, for example, useful in the simultaneous screening 
and functional analysis of multiple genes and gene families using high throughput 
strategies, as well as in direct gene suppression or silencing. 



Development of the Algorithms 

25 To identify siRNA sequence features that promote functionality and to 

quantify the importance of certain currently accepted conventional factors — such as 
G/C content and target site accessibility^the inventors synthesized an siRNA panel 
consisting of 270 siRNAs targeting three genes. Human Cyclophilin, Firefly 
Luciferase, and Human DBI. In all three cases, siRNAs were directed against specific 

30 regions of each gene. For Human Cyclophilin and Firefly Luciferase, ninety siRNAs 
were directed against a 199 bp segment of each respective mRNA. For DBI, 90 
siRNAs were directed against a smaller, 109 base pair region of the mRNA. The 
sequences to which the siRNAs were directed are provided below. 



It should be noted that in certain sequences, "t" is present. This is because 
many databases contain information in this maimer. However, the t denotes a uracil 
residue in mRNA and siRNA. Any algorithm will, unless otherwise specified, 
process a t in a sequence as a u. 

Human cvclopliilin: 193—390. M60857 
SEQ. ID NO. 29: 

gttccaaaaacagtggataattttgtggccttagctacaggagagaaaggatttggctacaaaaacagcaaattccatcgtgt 
aatcaaggacttcatgatccagggcggagacttcaccaggggagatggcacaggaggaaagagcatctacggtgagcg 
cttccccgatgagaacttcaaactgaagcactacgggcctggctggg 

Firefly luciferase: 1434—1631. U47298 ft)GL3. Promep a^ 
SEQ. ID NO. 30: 

tgaacttcccgccgccgttgttgttttggagcacggaaagacgatgacggaaaaagagatcgtggattacgtcgccagtca 
agtaacaaccgcgaaaaagttgcgcggaggagttgtgtttgtggacgaagtaccgaaaggtcttaccggaaaactcgacg 
caagaaaaatcagagagatcctcataaaggccaagaagg 

DBI. NM 020548 ( 202-31 0^ reverv position^ 
SEQ. ID NO. 0031: 

acgggcaaggccaagtgggatgcctggaatgagctgaaagggacttccaaggaagatgccatgaaagcttacatcaaca 
aagtagaagagctaaagaaaaaatacggg 

A list of the siRNAs appears in Table III (see Examples Section, Example II) 

The set of duplexes was analyzed to identify correlations between siRNA 
functionality and other biophysical or thermodynamic properties. When the siRNA 
panel was analyzed in functional and non-functional subgroups, certain nucleotides 
were much more abundant at certain positions in functional or non-ftmctional groups. 
More specifically, the frequency of each nucleotide at each position in highly 
functional siRNA duplexes was compared with that of nonftmctional duplexes in 
order to assess the preference for or against any given nucleotide at every position. 
These analyses were used to deteraiine important criteria to be included in the siRNA 
algorithms (Formulas VIII and DC). 



The data set was also analyzed for distinguishing biophysical properties of 
siRNAs in the functional group, such as optimal percent of GC content, propensity for 
internal structures and regional thermodynamic stability. Of the presented oiteria, 
several are involved in duplex recognition, RISC activation/duplex unwinding, and 
5 target cleavage catalysis. 

The original data set that was the source of the statistically derived criteria is 
shown in Figure 2. Additionally, this figure shows that random selection yields 
siRNA duplexes with unpredictable and widely varying silencing potencies as 

10 measured in tissue culture using HEK293 cells. In the figure, diq>lexes are plotted 
such that each x-axis tick-mark represents an individual siRNA, with eadi subsequent 
siRNA differing in target position by two nucleotides for Human Cyclophilin and 
Firefly Luciferase, and by one nucleotide for Human DBI. Furthemiore, the y-axis 
denotes the level of target expression remaining after transfection of the duplex into 

1 5 cells and subsequent silencing of the target 

SiRNA identified and optimized in this document work equally well in a wide 
range of cell types. Figure 3a shows the evaluation of thirty siRNAs targeting the 
DBI gene in three cell lines derived fi-om different tissues. Each DBI siRNA displays 

20 very similar functionality in HEK293 (ATCC, CR3L-1 573, human embryonic kidney), 
HeLa (ATCC, CCL-2, cervical epithelial adenocarcinoma) and DU145 (HTB-81, 
prostate) cells as deterimined by the B-DNA assay. Thus, siRNA fijnctionality is 
• determined by the primary sequence of the siRNA and not by the intracellular 
environment. Additionally, it should be noted that although the present invention 

25 provides for a determination of the functionality of siRNA for a given target, the same 
siRNA may silence more than one gene. For example, the complementary sequence 
of the silencing siRNA may be present in more than one gene. Accordingly, in these 
circumstances, it may be desirable not to use the siRNA with highest SMARTscore™. 
hi such circumstances, it may be desirable to use the siRNA with the next highest 

30 SMARTscore™. 

To determine the relevance of G/C content in siRNA funcfion, the G/C content 
of each duplex in the panel was calculated and the functional classes of siRNAs 
(<F50, > F50, > F80, > F95 where F refers to the percent gene silencing) were sorted 
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accordingly- The majority of the highly-fiinctional siRNAs (>F95) fell within ttie 
G/C content range of 36% — 52% (Figure 3B). Twice as many non-functional (< 
F50) duplexes fell within the high G/C content groups (>57% GC content) compared 
to die 36% — 52% gi'oup. The group with extremely low GC content (26% or less) 

5 contained a higher proportion of non-functional siRNAs and no highly-functional 
siRNAs. The G/C content range of 30% — 52% was thCTefore selected as Criterion I 
for siRNA functionality, consistent with the observation that a G/C range 30% — ^70% 
promotes efficient RNAi targeting. Apphcation of this criterion alone provided only a 
marginal increase in the probability of selecting functional siRNAs from the panel: 

10 selection of F50 and F95 siRNAs was improved by 3.6% and 2.2%, respectively. The 
siRNA panel presented here permitted a more systematic analysis and quantification 
of the importance of this criterion than that used previously. 



A relative measure of local internal stability is the A/U base pair (bp) content; 

1 5 therefore, the frequency of A/U bp was determined for each of the five terminal 
positions of the duplex (5' sense (S)/5' antisense (AS)) of all siRNAs in the panel. 
Duplexes were then categorized by the number of A/U bp in positions 1 — ^5 and 15 — 
19 of the sense strand. The thermodynamic flexibility of the duplex 5 '-end (positions 
1 — 5; S) did not appear to correlate appreciably with silencing potency, while that of 

20 the 3 '-end (positions 15 — 19; S) correlated with efficient silencing. No duplexes 
lacking A/U bp in positions 1 5 — 1 9 were functional. The presence of one A/U bp in 
this region conferred some degree of functionality, but the presence of three or more 
A/Us. was preferable and therefore defined as Criterion 11. When applied to the test 
panel, only a marginal increase in the probability of functional siRNA selection was 

25 achieved: a 1 .8% and 2.3% increase for F50 and F95 duplexes, respectively (Table 
IV). 



The complementary strands of siRNAs that contain internal rq)eats or 
palindromes may form internal fold-back structures. These hairpin-like structures 
30 exist in equilibrium with the duplexed form effectively reducing the concentration of 
functional duplexes. The propensity to form internal hairpins and their relative 
stability can be estimated by predicted melting temperatures. High Tm reflects a 
tendency to form hairpin structures. Lower Tm values indicate a lesser tendency to 
form hairpins. When the functional classes of siRNAs were sorted by T„, (Figure 3c), 



the following trends were identified: duplexes lacking stable internal repeats were the 
most potent silencers (no F95 duplex with predicted hairpin structure T,„ > 60 ^C). In 
contrast, about 60% of the duplexes in the gi oups having internal hairpins with 
calculated T« values less than 20 ""C were F80. Thus, the stability of internal repeats 
5 is inversely proportional to the silencing effect and defines Criterion III (predicted 
hairpin structure < 20 ^C). 

Sequence-based determinants of siRNA functionality 

When the siRNA panel was sorted into functional and non-functional groups, 

10 the fi*equency of a specific nucleotide at each position in a functional siRNA duplex 
was compared with that of a nonfunctional duplex in order to assess the preference for 
or against a certain nucleotide. Figure 4 shows the results of these queries and the 
subsequent resorting of the data set (firom Figure 2). The data is separated into two 
sets: those duplexes that meet the criteria, a specific nucleotide in a certain position - 

1 5 grouped on the left (Selected) and those that do not - grouped on the right 

(Eliminated). The duplexes are further sorted firom most functional to least functional 
with the y-axis of Figure 4a-e representing the % expression i.e. the amount of 
silencing that is elicited by the duplex (Note: each position on the X-axis represents a 
different duplex). Statistical analysis revealed correlations between silencing and 

20 several sequence-related properties of siRNAs. Figure-4-and Table IV show 

quantitative analysis for the following five sequence-related properties of siRNA: (A) 
an A at position 19 of the sense strand; (B) an A at position 3 of the sense strand; (C) 
a U at position 10 of the sense strand; (D) a base other than G at position 13 of the 
SOTse strand; and (E) a base other than C at position 19 of the sense strand. 

25 

When the siRNAs in the panel were evaluated for the presence of an A at 
position 19 of the sense strand, the percentage of non-functional duplexes decreased 
from 20% to 1 1.8%, and the percentage of F95 duplexes increased from 21.7% to 
29.4% (Table IV). Thus, the presence of an A in this position defined Criterion IV. 

30 

Another sequence-related property correlated with silencing was the presence 
of an A in position 3 of the sense strand (Figure 4b). Of the siRNAs wi& A3, 34.4% 
were F95, compared with 21 .7% randomly selected siRNAs. The presence of a U 
base in position 10 of the sense strand exhibited an even greater impact (Figure 4c). 



Of the duplexes in this group, 41 .7% were F95. These properties became criteria V 
and VI, respectively. 



Two negative sequence-related criteria that were identified also appear on 
5 Figure 4. The absence of a G at position 1 3 of the sense strand, conferred a marginal 
increase in selecting functional duplexes (Figure 4d). Similarly, lack of a C at 
position 19 of the sense strand also correlated with functionality (Figure 4e). Thus, 
among functional duplexes, position 19 was most likely occupied by A, and rarely 
occupied by C, These rules were defined as criteria VII and VIII, respectively. 

10 

Application of each criterion individually provided marginal but statistically 
significant increases in the probability of selecting a potent siRNA. Although the 
results were informative, the inventors sought to maximize potency and therefore 
consider multiple criteria or parameters. Optimization is particularly important when 
1 5 developing therapeutics. Interestingly, the probability of selecting a fimctional siRNA 
based on each thermodynamic criteria was 2%— 4% higher than random, but 4%— 
8% higher for the sequence-related determinates. Presumably, these sequence-related 
increases reflect the complexity of the RNAi mechanism and the multitude of protein- 
RNA interactions that are involved in RNAi-mediated silencing. 



Table rV 
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20 









Improvement 


Criterion 


% Functional 


over Random 


1. 30%-^2% G/C content 


<F50 


16.4% 


-3.6% 




>F50 


83.6% 


\J.\j /o 




>F80 


60.4% 


4.3% 




>F95 


23.9% 


2.2% 


!1. At least 3 A/U bases at positions 


<F50 


18.2% 


-1,8% 


i 5—1 9 of the sense strand 


>F50 


81.8% 


1 .0 /o 




>F80 


59.7% 


3.6% 




>F95 


24.0% 


2.3% 


111. Absence of internal repeats, 


<F50 


16.7% 


-3.3% 


as measured bv T« of 


>F50 


83.3% 


o oo/ 


secondary stmcture < 20*'C 


>F80 


61.1% 


\J.\J /o 




>F95 


24.6% 


2.9% 


IV. An A base at position 19 


<F50 


11.8% 


-8.2% 


of the sense strand 


>F50 


88.2% 


ft oo/. 




>F80 


75.0% 


Ifl Q% 




>F95 


29.4% 


7 7% 


V. An A base at position 3 


<.F50 


17.2% 


-2.8% 


nf f hp ^priQf^ strand 


>F50 


82.8% 


Z.o /o 




>F80 


62.5% 


D.*f /O 




>F95 


34.4% 


1 ? 7% 

\ c.. l /O 


VL A U base at position 10 


<F50 


13.9% 


-6.1% 




> F50 


86.1% 


C '1 o/ 
D.1% 




>F80 


69.4% 


1 o. o /o 




>F95 


41,7% 


9n% 

iC.V/ /o 


VII. A base other than Cat 


<F50 


18.8% 


-1.2% 


position 19 of the sense strand 


>F50 


81.2% 


1.2% 




>F80 


59.7% 


3.6% 




>F95 


24.2% 


2.5% 


VIII. A base other than G at 


<F50 


15.2% 


-4.8% 


position 1 3 of the sense strand 


>F50 


84.8% 


4.8% 




>F80 


61.4% 


5.3% 




>F95 


26.5% 


4.8% 



25 

The siRNA selection algorithm 

In an effort to improve selection further, all identified ariteria, including but 
not limited to those listed in Table IV were combined into the algorithms embodied in 
Formula VIII and Formula DC. Each siRNA was then assigned a score (referred to as 
30 a SMARTscore™) according to the values derived firom the formulas. Duplexes that 
scored higher than 0 or 20, for Formulas Vm and DC, respectively, effectively 
selected a set of functional siRNAs and excluded all non-functional siRNAs. 
Conversely, all duplexes scoring lower than 0 and 20 according to formulas VIII and 
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DC, respectively, contained some ftinctional siRNAs but included aU non-functional 
siRNAs. A graphical representation of this selection is shown in Figure 5. 



are 



The methods for obtaining the seven aiteria embodied in Table IV 
illustrative of the results of flie process used to develop the information for Formulas 
VIII and DC. Thus similar techniques were used to establish the other variables and 
their multipliers. As described above, basic statistical methods were use to determine 
the relative values for these multipliers. 

To determine the value for "Improvement over Random" the difference in the 
frequency of a given attribute (eg. GC content, base preference) at a particular 
position is determined between individual fimctional groups (e.g. <F50) and the total 
siRNA population studied (e.g. 270 siRNA molecules selected randomly). Thus, for 
instance, in Criterion I (30%-52% GC content) members of the <F50 group were 
observed to have GC contents between 30-52% in 16.4% of the cases. In contrast, the 
total group of 270 siRNAs had GC contents in this range, 20% of the time. Thus for 
this particular attribute, there is a small negative correlation between 30%-52% GC 
content and this functional group (i.e. 16.4%-20% = -3.6%). Similarly, for Criterion 
VI, (a "U" at position 10 of the sense strand), the >F95 group contained a "U" at this 
position 41.7% of the time. In contrast, the total group of 270 siRNAs had a "U" at 
this position 21 .7% of the time, thus the improvement over random is calculated to be 
20%(or41.7%-21.7%). 

Identifying The Averape Internal Stahilitv Profile of Sfa-ong .«riRNA 

In order to identify an internal stability profile fliat is characteristic of strong 
SiRNA, 270 different siRNAs derived from the cyclophilin B, tiie diazepam binding 
inhibitor (DBI), and the luciferase gene were individually transfected into HEK293 
cells and tested for their ability to induce RNAi of the respective gene. Based on their 
performance in the in vivo assay, the sequences were then subdivided into three 
groups, (i) >95% silencing; (ii) 80-95% silaicing; and (iii) less than 50% silencing. 
Sequences exhibiting 51-84% silencing were eliminated from fiirther consideration to 
reduce the difficulties in identifying relevant thermodynamic pattems. 
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Following the division of siRNA into three groups, a statistical analysis was 
performed on each member of each group to determine the average internal stability 
profile (AISP) of the siRNA. To accomplish this the Oligo 5.0 Primer Analysis 
Software and other related statistical packages (e.g. Excel) were exploited to 
5 determine the intemal stability of pentamers using the nearest neighbor method 

described by Freier et aL, (1986) Improved fi^ee-energy parameters for predictions of 
RNA duplex stability, Proc Natl Acad. Sci. U. S. A. 83(24): 9373-7. Values for each 
group at each position were then averaged, and the resulting data were graphed on a 
linear coordinate system with the Y-axis expressing the AG (firee energy) values in 
10 kcal/mole and the X-axis identifying the position of the base relative to the 5' end. 

The results of the analysis identified multiple key regions in siRNA molecules 
that were critical for successfiil gene silencing. At the 3 '-most end of the sense strand 
(5'antisense), highly fimctional siRNA (>95% gene silencing, see Figure 6a, >F95) 

1 5 have a low intemal stability (AISP of position 19 = -- -7.6kcal/mol). In contrast low- 
efficiency siRNA {i.e. those exhibitmg less than 50% silencing, <F50) display a 
distinctly diiferent profile, having high AG values (- -8.4kcal/mol) for the same 
position. Moving in a 5' (sense strand) direction, the intemal stability of highly 
efficient siRNA rises (position 12 = - -8.3kcal/mole) and then drops again (position 7 

20 -7.7kcal/mol) before leveling off at a value of approximately -S.lkeal/mol for the 

5' terminus. SiRNA with poor silendng capabilities show a distinctly different 
profile. While the AISP value at position 12 is nearly identical with that of strong 
siRNAs, the values at positions 7 and 8 rise considerably, peaking at a high of - -9.0 
kcal/mol. In addition, at the 5' end of the molecule the AISP profile of strong and 

25 weak siRNA differ dramatically. Unlike the relatively strong values exhibited by 

siRNA in the >95% silencing group, siRNAs that exhibit poor silencing activity have 
weak AISP values (-7.6, -7.5, and -7.5 kcal/mol for positions 1, 2 and 3 respectively). 

Ov«-all the profiles of both strong and weak siRNAs form distinct sinusoidal 
30 shapes that are roughly 1 80° out-of-phase with each other. While these 

thermodynamic descriptions define the archetypal profile of a strong siRNA, it will 
likely be the case that neither the AG values given for key positions in the profile or 
the absolute position of the profile along the Y-axis (i.e. the AG -axis) are absolutes. 
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Profiles that are shifted upward or downward (z. e. having on an average, higher or 
lower values at every position) but retain the relative shape and position of the profile 
along the X-axis can be foreseen as being equally effective as the model profile 
described here. Moreover, it is likely that siRNA that have strong or even stronger 
5 gene-specific silencing effects might have exaggerated AG values (either higher or 
lower) at key positions. Thus, for instance, it is possible that the 5'-most position of 
the sense strand (position 19) could have AG values of 7.4 kcal/mol or lower and still 
be a strong siRNA if, for instance, a G-C G-T/U mismatch were substituted at 
position 19 and altered duplex stability. Similarly, position 12 and position 7 could 

10 have values above 8.3 kcal/mol and below 7.7 kcal/mole, respectively, without 

abating the silencing effectiveness of the molecule. Thus, for instance, at position 12, 
a stabilizing chemical modification (e.g, a chemical modification of the 2' position of 
the sugar backbone) could be added that increases the average intemal stability at that 
position. Similarly, at position 7, mismatches similar to those described previously 

1 5 could be introduced that would lower the AG values at that position. 

Lastly, it is important to note that while functional and non-functional siRNA 
were originally defined as those molecules having specific silencing properties, both 
broader or more limiting parameters can be used to define these molecules. As used 

20 herein, unless otherwise specified, "non-functional siRNA" are defined as those 

siRNA that induce less than 50% (<50%) target silencing, "semi-functional siRNA" 
induce 50-79% target silencing, "functional siRNA" are molecules that induce 80- 
95% gene silencing, aird *1lighly-functional siRNA" are molecules that induce great 
than 95% gene silencing. These definitions are not intended to be rigid and can vary 

25 depending upon the design and needs of tlie application. For instance, it is possible 
that a researcher attempting to map a gene to a chromosome using a functional assay, 
may identify an siRNA that reduces gene activity by only 30%. While this level of 
gene silencing may be "non-functional" for e,g, therapeutic needs, it is sufficient for 
gene mapping purposes and is, under these uses and conditions, "functional." For 

30 these reasons, functional siRNA can be defined as those molecules having greater 
than 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% silencing capabilities at 
lOOnM transfection conditions. Similarly, depending upon the needs of the study 
and/or application, non-fimctional and semi-functional siRNA can be defined as 



having different parameters. For instance, semi-functional siRNA can be defined as 
being those molecules that induce 20%, 30%, 40%, 50%, 60%, or 70% silencing at 
lOOnM transfection conditions. Similarly, non-functional siRNA can be defined as 
being those molecules that silence gene expression by less than 70%, 60%, 50%, 
5 40%, 30%, or less. Nonetheless, unless otherwise stated, the descriptions stated in 
the "Definitions" section of this text should be applied. 

Functional attributes can be assigned to each of the key positions in the AISP 
of strong siRNA. The low 5' (sense strand) AISP values of strong siRNAs may be 
1 0 necessary for determining which end of the molecule enters the RISC complex. In 

contrast, the high and low AISP values observed in the central regions of the molecule 
may be critical for siRNA-target mRNA interactions and product release, 
rrapectively. 

1 5 If the AISP values described above accurately define the thennodynamic 

parameters of strong siRNA, it would be expected that similar patterns would be 
observed in strong siRNA isolated firom nature. Natural siRNAs exist in a harsh, 
RNase-rich «ivironment and it can be hypothesized that only those siRNA that 
exhibit heightened affinity for RISC (Le. siRNA that exhibit an average internal 

20 stability profile similar to those observed in strong siRNA) would survive in an... 
intracellular environment. This hypothesis was tested using GFP-spedfic siRNA 
isolated from N. benthamiana. Llave et al. (2002) Endogenous and Silencing- 
Associated Small RNAs in Plaints. Plant Cell 14, 1605-1619, introduced long 
double-stranded GFP-encoding RNA into plants and subsequently re-isolated GFP- 

25 specific siRNA finom the tissues. The AISP of fifty-nine of these GFP-siRNA were 
determined, averaged, and subsequently plotted alongside the AISP profile obtained 
fix)m the cyclophilin B/DBI/ luciferase siRNA having >90% silencing properties 
(Figure 6b). Comparison of the two groups show that profiles are nearly identical. 
This finding validates the information provided by the internal stability profiles and 

30 demonstrates that: (1) the profile identified by analysis of the cyclophilin B/DBI/ 

luciferase siRNAs are not gene specific; and (2) AISP values can be used to search for 
strong siRNAs in a variety of species. 



Both chemical modifications and base-pair mismatches can be incorporated 
into siRNA to alter the duplex's AISP and functionality. For instance, introduction of 
mismatches at positions 1 or 2 of the sense strand destabilized the 5'end of the sense 
strand and increases the functionality of the molecule (see Luc, Figure 7). Similarly, 
5 addition of 2'-0-methyl groups to positions 1 and 2 of the sense strand can also alter 
the AISP and (as a result) increase both the functionality of the molecule and 
eliminate off-target effects that results firom sense strand homology with the unrelated 
targets (Figures 8a, 8b). 

10 Rationale for Criteria in a Biological Context 

The fate of siRNA in the RNAi pathway may be described in 5 major steps: 
(1) duplex recognition and pre-RISC complex foinaation; (2) ATP-dependent duplex 
unwinding/strand selection and RISC activation; (3) mRNA target identification; (4) 
mRNA cleavage, and (5) product release (Figure 1). Given the level of nucleic acid- 

1 5 protein interactions at each step, siRNA fiinctionality is likely influenced by specific 
biophysical and molecular properties that promote efficient interactions within the 
context of the multi-component complexes. Indeed, the systematic analysis of flie 
siRNA test set identified multiple factors that correlate well with functionality. When 
combined into a single algorithm, they proved to be very effective in selecting active 

20 siRNAs. 

The factors described here may also be predictive of key functional 
associations important for each stepJnJRNAi. For example, the potential formation of 
internal hairpin stiiictures correlated negatively with siRNA functionality. 

25 Complementary stands with stable internal repeats are more likely to exist as stable 
hairpins tiius decreasing the effective concenh-ation of the functional duplex fomi. 
This suggests that tiie duplex is the preferred conformation for initial pre-RISC 
association. Indeed, althou^ single complementary stiands can induce gene 
silencing, the effective concentration required is at least two orders of magnitude 

30 higher than that of the duplex foim. 

siRNA-pre-RISC complex formation is followed by an ATP-dependent duplex 
unwinding step and "activation" of the RISC. The siRNA fiinctionality was shown to 
correlate with overall low internal stability of the duplex and low internal stability of 



the 3' sense end (or differential internal stability of the 3' sense compare to the 5' 
sense strand), which may reflect strand selection and entry into the RISC. Overall 
duplex stability and low internal stabiUty at the 3' end of the sense strand were also 
correlated with siRNA functionality. Interestingly, siRNAs with very high and very 
5 low overall stability profiles correlate strongly with non-functional duplexes. One 
interpretation is that high internal stability prevents efficient unwinding while very 
low stability reduces siRNA target affinity and subsequent mRNA cleavage by the 
RISC. 

1 0 Several criteria describe base preferences at specific positions of the sense 

strand and are even more intriguing when considering their potential mechanistic 
roles in target recognition^d mRNA cleavage. Base preferences for A at position 19 
of the sense strand but not C, are particularly interesting because they reflect the same 
base preferences observed for naturally occurring miRNA precuisors. That is, among 

1 5 the reported miRNA precursor sequences 75% contain a U at position 1 which 

corresponds to an A in position 19 of the sense strand of siRNAs, while G was under- 
represented in this same position for miRNA precursors. These observations support 
the hypothesis that both miRNA precursors and siRNA duplexes are processed by 
very similar if not identical protem machinery. The fimctional interpretation of the 

20 predominance of a U/A base pair is that it promotes flexibility at the 5'antisense ends 
of both siRNA duplexes and miRNA precursors and facilitates efficient unwinding 
and selective strand entrance into an activated RISC. 

Among the criteria associated wifli base preferences that are likely to 
25 influence mRNA cleavage or possibly product release, the preference for U at 
position 10 of the sense strand exhibited the greatest impact, enhancing the 
probability of selecting an F80 sequence by 1 3.3%. Activated RISC preferentially 
cleaves target mRNA between nucleotides 10 and 1 1 relative to the 5' end of the 
complementary targeting strand. Therefore, it may be that U, the preferred base for 
30 most endoribonucleases, at this position supports more efficient cleavage. 

Alternatively, a U/A bp between the targeting siRNA strand and its cognate target 
mRNA may create an optimal conformation for the RISC-associated "slicing" 
activity. 
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Pooling 

According to another embodiment, the present invention provides a pool of at 
least two siRNAs, preferably in the form of a kit or therapeutic reagent, wherein one 
strand of each of the siRNAs, the sense strand comprises a sequence that is 
5 substantially similar to a sequence within a target mRNA. The opposite strand, the 
antisense strand, will preferably comprise a sequence that is substantially 
complementary to that of the target mRNA. More preferably, one strand of each 
siRNA will comprise a sequence that is identical to a sequence that is contained in the 
target mRNA. Most preferably, each siRNA will be 1 9 base pairs in length, and one 
10 strand of each of the siRNAs will be 100% complementary to a portion of the target 
mRNA. 

By increasing the number of siRNAs directed to a particular target using a 
pool or kit, one is able both to increase the likelihood that at least one siRNA with 

1 5 satisfactory functionality will be included, as well as to benefit firom additive or 

synergistic effects. Further, when two or more siRNAs directed against a single gene 
do not have satisfactory levels of functionality alone, if combined, they may 
satisfactorily promote degradation of the target messenger RNA and successfully 
inhibit translation. By including multiple siRNAs in the system, not only is the 

20 probability of silencing increased, but the economics of operation are also improved 
when compared to adding different siRNAs sequentially. This effect is contrary to the 
conventional wisdom that the concurrent use of multiple siRNA will negatively 
impact gene silencing (e.g. Holen, T.etal^2Q03) "Similar behavior of single strand 
and double strand siRNAs suggests they act through a conunon RNAi pathway." 

25 NAR 31: 2401-21407). 

In fact, when two siRNAs were pooled together, 54% of the pools of two 
siRNAs induced more than 95% gene silencing. Thus, a 2.5-fold increase in the 
percentage of functionality was achieved by randomly combining two siRNAs. 
30 Further, over 84% of pools containing two siRNAs induced more than 80% gene 
silencing. 

More preferably, the kit is comprised of at least three siRNAs, wherein one 
strand of each siRNA comprises a sequence that is substantially similar to a sequence 



of the target mRNA and the other strand comprises a sequence that is substantially 
complementary to the region of the target mRNA. As with the kit that comprises at 
least two siRNAs, more preferably one strand will comprise a sequence that is 
identical to a sequaice that is contained in the mRNA and another strand that is 1 00% 
5 complementary to a sequence that is contained in the mRNA. During experiments, 
when three siRNAs were combined together, 60% of the pools induced more than 
95% gene silencing and 92% of the pools induced more than 80% gene silencing. 

Further, even more preferably, the kit is comprised of at least four siRNAs, 
1 0 wherein one strand of each siRNA comprises a sequence that is substantially similar 
to a region of the sequence of the target mRNA, and the other strand comprises a 
sequence that is substantially complementary to the region of the target mRNA. As 
with the kit or pool that comprises at least two siRNAs, more preferably one strand of 
each of the siRNA duplexes will comprise a sequence that is identical to a sequence 
1 5 that is contained in the mRNA, and another strand that is 100% complementary to a 
sequence that is contained in the mRNA. 

Additionally, kits and pools with ait least five, at least six, and at least seven 
siRNAs may also be useful with the present invention. For example, pools of five 

20 siRNA induced 95% gene silencing with 77% probability and 80% silencing with 
98.8% probability. Thus, pooling of siRNAs together can result in the creation of a 
target-specific silencing reagent with almost a 99% probability of being fimctional. 
The fact that such high levels of success are ae^evable using such pools of siRNA, 
enables one to dispense with costly and time-consuming target-specific validation 

25 procedures. 

For this embodiment, as well as the other aforementioned embodiments, each 
of the siRNAs within a pool will preferably comprise between 18 and 30 base pairs, 
more preferably between 1 8 and 25 base pairs, and most preferably 1 9 base pairs. 
30 Within each siRNA, preferably at least 1 8 contiguous bases of the antisense strand 
will be 100% complementary to the target mRNA. More preferably, at least 19 
contiguous bases of the antisense strand will be 100% complementary to the target 
mRNA. Additionally, there may be overhangs on either the sense sti-and or the 
antisense strand, and these overhangs may be at either the 5' end or the 3 ' end of 



either of the strands, for example there may be one or more overhangs of 1-6 bases. 
When overhangs are present, they are not included in the calculation of the number of 
base pairs. The two nucleotide 3' overhangs mimic natural siRNAs and are 
commonly used but are not essential. Preferably, the overhangs should consist of two 
5 nucleotides, most often dTdT or UU at the 3' end of the sense and antisense strand 
that are not complementary to the target sequence. The siRNAs may be produced by 
any method that is now known or that comes to be known for synthesizing double 
stranded RNA that one skilled in the art would appreciate would be useful in the 
pres«it invention. Preferably, the siRNAs will be produced by Dhaimacon's 
1 0 proprietary ACE® technology. However, other methods for synthesizing siRNAs are 
well known to persons skilled in the art and include, but are not limited to, any 
chemical synthesis of RNA oligonucleotides, ligation of shorter oligonucleotides, in 
vitro transcription of RNA oligonucleotides, the use of vectors for expression within 
cells, recombinant Dicer products and PGR products. 

15 

The siRNA duplexes within the aforonentioned pools of siRNAs may 
correspond to overlapping sequences within a particular mRNA, or non-overlapping 
sequences of the mRNA. However, preferably they correspond to non-overlapping 
sequences. Further, each siRNA may be selected randomly, or one or more of the 
20 siRNA May be selected according to the criteria discussed above for maximizing the 
effectiveness of siRNA. 

Included in the definition of siRNAs areaRMAs that contain substituted 
and/or labeled nucleotides that may, for example, be labeled by radioactivity, 

25 fluorescence or mass. The most common substitutions are at the 2' position of the 
ribose sugar, where moieties such as H (hydrogen) F, NH3, OCH3 and other O- alkyl, 
alkenyl, alkynyl, and orthoesters, may be substituted, or in the phosphorous backbone, 
where sulfur, amines or hydrocarbons may be substituted for the bridging of non- 
bridging atomsjn the phosphodiester bond. Examples of modified siRNAs are 

30 explained more fully in conmionly assigned U.S. Patent Application Ser. No. 
10/613,077, filed July 1, 2003, which is incorporated by reference herein. 



Additionally, as noted above, the cell type into which the siRNA is introduced 
may affect the ability of the siRNA to enter the cell; however, it does not appear to 



affect the ability of the siRNA to ftinction once it enters the cell. Methods for 
introducing double-stranded RNA into various cell types are well known to persons 
skilled in the art. 

5 As persons skilled in the art are aware, in certain species, the presence of 

proteins such as RdRP, the RNA-dependent RNA polymerase, may catalytically 
enhance the activity of the siRNA. For example, RdRP propagates the RNAi effect in 
C elegans and other non-mammalian organisms. In fact, in organisms that contain 
these proteins, the siRNA may be inherited. Two other proteins that are well studied 
1 0 and known to be a part of the machinery are members of the Argonaute family and 
Dicer, as well as their homologues. There is also initial evidence that the RISC 
complex might be associated with the ribosome so the more efficiently translated 
mRNAs will be more susceptible to silencing than others. 

1 5 Another very important factor in the efficacy of siRNA is mRNA localization. 

In general, only cytoplasmic mRNAs are considered to be accessible to RNAi to any 
appreciable degree. However, appropriately designed siRNAs, for example, siRNAs 
modified with intemucleotide linkages, may be able to cause silencing by acting in the 
nucleus. Examples of these types of modifications are desaibed in commonly 

20 assigned U.S. Patent Application Serial Nos. 10/431,027 and 10/613,077, each of 
which is incorporated by reference herein. 

As described above, even when one selects atJeast=two siRNAs at random, the 
effectiveness of the two may be greater than one would predict based on the 

25 effectiveness of two individual siRNAs. This additive or synergistic effect is 
particularly noticeable as one increases to at least three siRNAs, and even more 
noticeable as one moves to at least four siRNAs. Surprisingly, the pooling of the non- 
functional and semi-functional siRNAs, particularly more than five siRNAs, can lead 
to a silencing mixture that is as effective if not more effective than any one particular 

30 functional siRNA. 



Within the kit of the present invention, preferably each siRNA will be present 
in a concentration of between 0.001 and 200 p.M, more preferably between 0.01 and 
200 nM, and most preferably between 0.1 and 10 nM. 



5 In addition to preferably comprising at least four or five siRNAs, the kit of the 

present invention will also preferably comprise a buffer to keep the siRNA duplex 
stable. Persons skilled in the art are aware of buffers suitable for keeping siRNA 
stable. For example, the buffer may be comprised of 1 00 mM KCl, 30 mM HEPES- 
pH 7.5, and 1 mM MgC^. Alternatively, kits might contain complementary strands 
10 that contain any one of a number of chemical modifications (e.g. a 2'-OACE) that 
protect the agents fi-om degradation by nucleases. In this instance, the user may (or 
may not) remove the modifying protective group (e,g, deprotect) before amiealing the 
two complementary strands together. 

1 5 By way of example, the kit may be organized such that pools of siRNA 

duplexes are provided on an array or microarray of wells or drops for a particular 
gene set or for unrelated genes. The array may, for example, be in 96 wells, 384 wells 
or 1284 wells arrayed in a plastic plate or on a glass slide using techniques now 
known or that come to be known to persons skilled in the art. Within an array, 

20 preferably there%ill be controls such as ftinctional anti-lamin A/C, cyclophilin and 
two siRNA duplexes that are not specific to the gene of interest 

In order to ensure stability of the siRNA pool^*prior-4o=usage, they may be 
retained in lyophilized form at minus twenty degrees (— 20°C) until they are ready for 

25 use. Prior to usage, they should be resuspended; however, even once resuspended, for 
example, in the aforementioned buffer, they should be kept at minus twenty degrees, 
(— 20''C) until used. The aforementioned buffer, prior to use, may be stored at 
approximately A^'C or room temperature. Effective temperatures at which to conduct 
transfections are well known to persons skilled in the art and include for example, 

30 room temperature. 

The kit may be applied either in vivo or in vitro. Preferably, the siRNA of the 
pools or kits is applied to a cell tluough transfection, employing standard transfection 
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protocols. These methods are well known to persons skilled in the art and include the 
use of lipid-based carriers, electroporation, cationic carriers, and microinjection. 
Further, one could apply the present invention by synthesizing equivalent DNA 
sequences (either as two separate, complementary strands, or as hairpin molecules) 
5 instead of siRNA sequences and introducing them into cells through vectors. Once in 
the cells, the cloned DNA could be transcribed, thereby forcing the cells to generate 
the siRNA. Examples of vectors suitable for use witli the present application include 
but are not limited to the standard transient expression vectors, adenoviruses, 
retroviruses, lentivirus-based vectors, as well as other traditional expression vectors. 
1 0 Any vector that has an adequate siRNA expression and procession module may be 
used. Furthermore, certain chemical modifications to siRNAs, including but not 
limited to conjugations to other molecules, may be used to facilitate delivery. For 
certain applications it may be preferable to deliver molecules without transfection by 
simply fonnulating in a physiological acceptable solution. 

15 

This embodiment may be used in connection with any of the aforementioned 
embodiments. Accordingly, the sequences within any pool may be selected by 
rational design. 



Multigene Silencing 

In addition to developing kits that contain multiple siRNA directed against a 
single gene, another embodiment includes the use of multiple.^EblA targeting 
multiple genes. Multiple genes may be targeted through the use of high- or hyper- 

25 functional siRNA. High- or hyper- functional siRNA that exhibit increased potency, 
require lower concentrations to induce desired phenotypic (and thus therapeutic) 
effects. This circumvents RISC saturation. Itthereforereasonsthat if lower 
concentrations of a single siRNA are needed for knockout or knockdown expression 
of one gene, then the remaining (uncomplexed) RISC will be jfree and available to 

30 interact with siRNA directed against two, three, four, or more, genes. Thus in this 
embodiment, the authors describe the use of highly functional or hyper-functional 
siRNA to knock out three separate genes. More preferably, such reagents could be 
combined to knockout four distinct genes. Even more preferably, highly functional or 
hyperfunctional siRNA could be used to knock out five distinct genes. Most 
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preferably, siRNA of this type could be used to knockout or knockdown the 
expression of six or more genes. 

Hvpei-fiinctional siRNA 
5 The term hyperfunctional siRNA (hf-siRNA) describes a subset of the siRNA 

population that induces RNAi in cells at low- or sub-nanomolar concentrations for 
extended periods of time. These traits, heightened potency and extended longevity of 
the RNAi phenotype, are highly attractive from a therapeutic standpoint. Agents 
having higher potency require lesser amounts of the molecule to achieve the desired 
10 physiological response, thus reducing the probability of side effects due to "off- 
target" interference. In addition to the potential therapeutic benefits associated with 
hyperfimctional siRNA, hf-siRNA are also desirable'from an economic position. 
Hyperfunctional siRNA may cost less on a per-treatment basis, thus reducing the 
overall expenditures to both the manufacturer and the consumer. 

1 5 Identification of hyperfunctional siRNA involves multiple steps that are 

designed to examine an individual siRNA agent's concentration- and/or longevity- 
profiles. ]n one non-limiting example, a population of siRNA directed against a single 
gene are first analyzed using the previously described algorithm (Formula VIII). 
Individual siRNA are then introduced into a test cell line and assessed for the ability 

20 to degrade the target mRNA, It is important to note that when performing this step it 
is not necessary to test all of flie siRNA. Instead, it is sufficient to test only those 
siRNA having the highest SMARTscores™ (i.e. SMARTscore™ > -1 0). 
Subsequently, the gene silencing data is plotted against the SMARTscores™ (see 
Figure 9). SiRNA that (1) induce a high degree of g«ie silencing (i.e. they induce 

25 greater than 80% gene knockdown) and (2) have supoior SMARTscores™ (i.e. a 
SMARTscore™ of > -10, suggesting a desirable average intemal stability profile) are 
selected for further investigations designed to better understand the molecule's 
potency and longevity. In one, non-limiting study dedicated to understanding a 
molecule's potency, an siRNA is introduced into one (or more) cell types in 

30 increasingly diminishing concentrations {e.g. 3.0 0.3 nM). Subsequently, the level 
of gene silencing induced by each concentration is examined and siRNA that exhibit 
hyperfimctional potemcy (i.e. those that induce 80% alencing or greater at e.g. 
picomolar concentrations) are identified. In a second study, the longevity profiles of 



siRNA having high (>-l 0) SMARTscores™ and greater than 80% silencing are 
examined, hi one non-Hmiting example of how this is achieved, siRNA are introduced 
into a test cell line and the levels of RNAi are measured over an extended period of 
time (e.^. 24-1 68 hrs). SiRNAs that exhibit strong RNA interference patterns (z.e. 
5 >80 % interference) for periods of time greater than, e,g., 1 20 hours are thus 

identified. Studies similar to those described above can be performed on any and all 
of the >10^ siRNA included in this document to further define the most fimctional 
molecule for any given gene. Molecules possessing one or both properties (extended 
longevity and heightened potency) are labeled "hyperfimctional siRNA," and 

1 0 earmarked as candidates for future therapeutic studies- 
While the example(s) given above describe one means by which 
hyperfunctional siRNA can be isolated, neither the assays themselves nor the 
selection parameters used are rigid and can vary with each family of siRNA. Families 

1 5 of siRNA include siRNAs directed against a single gene, or directed against a related 
family of genes. 

The highest quality siRNA achievable for any given gene may vary 
considerably. Thus, for example, in the case of one gene (gene X), rigorous studies 

20 such as those described above may enable the identification of an siRNA that, at 
picomolar concentrations, induces 99"^% silencing for a period of 10 days. Yet 
identical studies of a second gene (gene Y) may yield an siRNA that at high 
nanomolar concentrations (e.g. lOOnM) induces only 75% silencing. foiua^eriod of 2 
days. Both molecules represent the very optimum siRNA for their respective gene 

25 targets and therefore are designated "hyperfunctional." Yet due to a variety of factors 
including but not limited to target concentration, siRNA stabiUty, cell type, off-target 
interference, and others, equivalent levels of potency and longevity are not 
achievable. Thus, for these reasons, the parameters described in the before mentioned 
assays, can vary. While the initial screen selected siRNA that had SMARTscores™ 

30 above -1 0 and a gene silencing capability of greater than 80%, selections that have 
stronger (or weaker) parameters can be implemented. Similarly, in the subsequent 
studies designed to identify molecules with high potency and longevity, the desired 
cutoff criteria (Le, the lowest concentration that induces a desirable level of 
interference, or the longest period of time that interference can be observed) can vary. 
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The experimentation subsequent to application of the rational criteria of this 
application is significantly reduced where one is trying to obtain a suitable 
hyperfunctional siRNA for, for example, therapeutic use. When, for example, the 
additional experimentation of the type described herein is applied by one skilled in the 
5 art with this disclosure in hand, a hyperfunctional siRNA is readily identified. 

The siRNA may be introduced into a cell by any method that is now known or 
that comes to be known and that firom reading this disclosure, persons skilled in the 
art would determine would be usefiU in connection with tihe present invention in 

10 enabling siRNA to cross the cellular membrane. These methods include, but are not 
limited to, any manner of transfection, such as for example transfection employing 
DEAE-Dextran, calcium phosphate, cationic lipids/liposomes; micelles, manipulation 
of pressure, microinjection, electroporation, inomunoporation, use of vectors such as 
viruses, plasmids, cosmids, bacteriophages, cell fusions, and coupling of the 

15 polynucleotides to specific conjugates or ligands such as antibodies, antigens, or 

receptors, passive introduction, adding moieties to the siRNA that facilitate its uptake, 
and the like. 

Having described the invention with a degree of particularity, examples will 
20 now be provided. These examples are not intended to and should not be construed to 
limit the scope of the claims in any way. 

Examples .r-^^ ^ 

25 General Techniques and Nomenclatures 

siRNA nomenclature. All siRNA duplexes are referred to by sense strand. The first 
nucleotide of the 5 '-end of the sense strand is position 1, which corresponds to 
position 19 of the antisense strand for a 19-mer. In most cases, to compare results 
firom different experiments, silencing was detemiined by measuring specific transcript 
30 mRNA levels or enzymatic activity associated with specific transcript levels, 24 hours 
post-transfection, with siRNA concentrations held constant at 100 nM. For all 
experiments, unelss otherwise specified transfection efficiency was ensured to be over 
95%, and no detectable cellular toxicity was observed. The following system of 
nomenclature was used to compare and report siRNA-silencing functionality: "F" 



DO 

followed by the degree of minimal knockdown. For example, F50 signifies at least 
50% knockdown, F80 means at least 80%, and so forth. For this study, all sub-F50 
siRNAs were considered non-functional. 

Cell culture and transfection. 96-well plates are coated with 50 fil of 50 mg/ml poly- 
L-lysine (Sigma) for 1 hr, and then washed 3X with distilled water before being dried 
for 20 min. HEK293 cells or HEK293Liics or any other cell type of interest are 
released fi^om their solid support by trypsinization, diluted to 3.5 X 10^ cells/ml, 
followed by the addition of 100 ^iL of cells/well. Plates are then incubated overnight 
at 37° C, 5% CO2. Transfection procedures can vary Avidely depending on the cell 
type and transfection reagents, hi one non-limiting example, a transfection mixture 
consisting of 2 mL Opti-MEM I (Gibco-BRL), 80 ^il Lipofectamine 2000 
(Invitrogen), 1 5 |aL SUPERNasin at 20 U/jil (Ambion), and 1 .5 |il of reporter gene 
plasmid at 1 iig/ \il is prepared in 5-ml polystyrene round bottom tubes. 100 |il of 
transfection reagent is then combined with 100 \xl of siRNAs in polystyrene deep- well 
titer plates (Beckman) and incubated for 20 to 30 min at room temp. 550 jit of Opti- 
MEM is then added to each well to bring the filial siRNA concentration to 100 nM. 
Plates are then sealed with parafihn and mixed. Media is removed firom HEK293 
cells and replaced with 95 jil of transfection mixture. Cells are incubated overnight at 
37^ C, 5% CO2. 

Quantification of gene knockdown. A variety of quantification procedures can be 
used to measure the level of silencing induced by siRNA or siRNA pools. In one non- 
limiting example: to measure mRNA levels 24 hrs post-transfection, QuantiGene 
branched-DNA (bDNA) kits (Bayer) (Wang, et ah Regulation of insulin preRNA 
splicing by glucose. Proc Natl Acad Sci 1997, 94:4360.) are used according to 
manufacturer instractions. To measure luciferase activity, media is removed firom 
HEK293 cells 24 hrs post-transfection, and 50 ^il of Steady-GLO reagent (Promega) 
is added. After 5 min, plates are analyzed on a plate reader. 

Example I. Sequences Used to Develop the Algorithm. 

Anti-Firefly and anti-Cyclophilin siRNAs panels (Figure 5a, b) sorted 
according to using Formula VIII predicted values. All siRNAs scoring more than 0 



(formula VIII) and more then 20 (formula IX) are folly ftmctional. All ninety 
sequences for each gene (and DBI) appear below in Table III. 



TABLE m 



Cyclo 


1 


SEQ.ID0032 


GUUCCAAAAACAGUGGAUA 


Cyclo 


2 


SEQ. ID 0033 


UCCAAAAACAGUGGAUAAU 


Cyclo 


3 


SEQ. ID 0034 


CAAAAACAGUGGAUAAUUU 


Cyclo 


4 


SEQ. ED 0035 


AAAACAGUGGAUAAUUUUG 


Cyclo 


5 


SEQ. ID 0036 


AACAGUGGAUAAUUUUGUG 


Cyclo 


6 


SEQ. ID 0037 


CAGUGGAUAAUUUUGUGGC 


Cyclo 


7 


SEQ. ID 0038 


GUGGAUAAUUUUGUGGCCU 


Cyclo 


8 


SEQ. ID 0039 


GGAUAAUUUUGUGGCCUUA 


Cyclo 


9 


SEQ. ID 0040 


AUAAUUUUGUGGCCUUAGC 


Cyclo 


10 


SEQ. ID 0041 


AAUUUUGUGGCCUUAGCUA 


Cyclo 


11 


SEQ. ID 0042 


UUUUGUGGCCUUAGCUACA 


Cyclo 


12 


SEQ. ID 0043 


UUGUGGCCUUAGCUACAGG 


Cyclo 


13 


SEQ. ID 0044 


GUGGCCUUAGCUACAGGAG 


Cyclo 


14 


SEQ. ID 0045 


GGCCUUAGCUACAGGAGAG 


Cyclo 


15 


SEQ. ID 0046 


CCUUAGCUACAGGAGAGAA 


Cyclo 


16 


SEQ. ID 0047 


UUAGCUACAGGAGAGAAAG 


Cyclo 


17 


SEQ. ID 0048 


AGCUACAGGAGAGAAAGGA 


Cyclo 


18 


SEQ. ID 0049 


CUACAGGAGAGAAAGGAUU 


Cyclo 


19 


SEQ. ID 0050 


ACAGGAGAGAAAGGAUUUG 


Cyclo 


20 


SEQ. ID 0051 


AGGAGAGAAAGGAUUUGGC 


Cyclo 


21 


SEQ. ID 0052 


GAGAGAAAGGAUUUGGCUA 


Cyclo 


22 


SEQ. ID 0053 


GAGAAAGGAUUUGGCUACA 


Cyclo 


23 


SEQ. ID 0054 


GAAAGGAUUUGGCUACAAA 


Cyclo 


24 


SEQ. ID 0055 


AAGGAUUUGGCUACAAAAA 


Cyclo 


25 


SEQ. ID 0056 


GGAUUUGGCUACAAAAACA 


Cyclo 


26 


SEQ. ID 0057 


AUUUGGCUACAAAAACAGC 


Cyclo 


27 


SEQ. ID 0058 


UUGGCUACAAAAACAGCAA 


Cyclo 


28 


SEQ. ID 0059 


GGCUACAAAAACAGCAAAU 


Cyclo 


29 


SEQ. ID 0060 


CUACAAAAACAGCAAAUUC 


Cyclo 


30 


SEQ. ED 0061 


ACAAAAACAGCAAAUUCCA 


Cyclo 


31 


SEQ. ID 0062 


AAAAACAGCAAAUUCCAUC 


Cyclo 


32 


SEQ. ID 0063 


AAACAGCAAAUUCCAUCGU 


Cyclo 


33 


SEQ. ID 0064 


ACAGCAAAUUCCAUCGUGU 


Cyclo 


34 


SEQ. ID 0065 


AGCAAAUUCCAUCGUGUAA 
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Cyclo 


35 


SEQ. ID 0066 




Cyclo 


36 


SEQ. ID 0067 


AAl IT TPr ATTPfrT irsi I A AT TP A A 


Cyclo 


37 


SEQ. ID 0068 


UUCCATTrGIIfrTTA AITP A Anr: 


Cyclo 


38 


SEQ. ID 0069 


cr AT irni tgt t a a t tp a a a p 


Cyclo 


39 


SEQ. ID 0070 


AUCGT IGT TA At IP A Ann aptti t 


Cyclo 


40 


SEQ. ID 0071 


PGT IGT T A AI TP A A nn A Pin T/^ a 


Cyclo 


41 


SEQ. ID 0072 


T Jfil T A AITP A Ann A PI TT TP ATTn 


Cyclo 


42 


SEQ. ID 0073 


IJA AT TP A Ann A PT Tl TP AT Tn A f T 


Cyclo 


43 


SEQ. ID 0074 


AT TP A AGO A PT IT TP A T in A T TPr» 


Cyclo 


44 


SEO. ID 0075 


P A Ann APT TT TP ATTO AT TPr* A ri 


Cyclo 


45 


SEQ. ID 0076 


AffG A PT IT TP ATTG A T TPP A nnn 


Cyclo 


46 


SEO. ID 0077 


G APT Tf TP AT TG AT TPP A nnnnn 


Cyclo 


47 


SEQ. ID 0078 


PT TT TP AT TG AT TPP A nnnnnn a 


Cyclo 


48 


SEO ID 0079 


T IP AT TG A T TPP A nnnnnn An a 

KJ K^rK kJ \ j/1l\J WW AVJ vJvJL^VJVJ AIJ a 


Cyclo 


49 


SEQ. ID 0080 


AT IG AT TPP A nnnnnn An Ami 


Cyclo 


50 


SEQ. ID 0081 


G AT TPP A GGGPnr* A ri A r'T TT Tr* 


Cyclo 


51 


SEO ID 0082 


T TPP A nrmpnn A n A rT TT Tr» A 

uv^V-- Avjovjv^O\jAvjAC.|J U VxAO 


Cyclo 


52 


SEQ. ID 0083 


CAGGGCGGAGACUUCACCA 


Cyclo 


53 


SEQ. ID 0084 


GGGCGGAGACUUCACCAGG 


Cyclo 


54 


SEO ED 0085 


nnnn a rj a pt it tp a nn a nnnn 


Cyclo 


55 


SEO ID 0086 


nn An A pt tt m a nn a n.nnn a n 


Cyclo 


56 


SEO ID 0087 


AG APT TT TP A PP A r^rsri/^ A ri AT T 
/\vj/\\^u uv^Av^v^AvjOOvjAvjAU 


Cyclo 


57 


SEO ID 0088 


V APTTTTPAPP A nnr^n A r: AT Tr:ri 
/\w yj KJ *^Aww AOvJVJij Avj AU vjLr 


Cyclo 


58 


SEO ID 0089 


T TT TP A PP A nn/^rj A h.^m,nn a 
U Ul^ Al-^\^AOOOOAO AUOUCA 


Cyclo 


59 


SEO ID 0090 


p A PP A nnnn a n a t mrsp a p a 


Cyclo 


60 


SEO ID 0091 


PP A GGGrs A n AT Tnr^r' An a nn 

V-^ w/\vj VJ O VJ AO AU O VJ w Ai-/ AvJxJ 


Cyclo 


61 


SEO ID 0092 


Annnn An Ai mnn An Ann An 
AUOvJOAOAUVJviCAL^AvjvrAvj 


Cyclo 


62 


SEO ID 0093 


GGG AG ATTGfiP A PAnr^ A r^n A 


Cyclo 


63 


SEO ID 0094 


GAGATTGGPAPArsn Ann A A A 
OAOAwOOwAV-rAOOAOOAAA 


Cyclo 


64 


SEO ID 0095 


GATTGGPAPAGr; AnrjA A An A 

0/\V/OOV>AV.xAOOAOOAAAVJA 


Cyclo 


65 


SEO ID 0094 


TTGGPAPAGGAnnA A An Ann 

UOOV^/\\^/\OOAOOAAAOAVjC' 


Cyclo 


66 


SEO ID 0096 


GPAPAGGAGnA A An AnnATT 
OV'/W^AOOAOOAAAOAOOAU 


Cyclo 


67 


SEO ID 0097 


APAGGAGGA A AnAnnATTr*TT 

AwrvoO AOO AAAO AO U AU U 


Cyclo 


68 


SEO ID 0098 


AGGAGGA A AnAnp ATTmiAn 

/^OO/YOVJAAAOAOwAUv^UAL-- 


Cyclo 


69 


SEO ID 0099 


GAGG A A AG A np AT TPT T A nnn 

O/^OO/VfV/TLOAOl^AUL'U Av^-VJO ■ 


Cyclo 


70 


SEO. ID 0100 


GG A A AG AGP AT TPT T A pnnT in 

\j\jrvfTLrt.VJr\vJ w/\lj KJ AwOO UO 


Cyclo 


71 


SEO ID 0101 


A A AG AGP AT TPT T A PnnT Tn A n 

■"-«*"Vjjrvvj w^ w Vr w/\,wOO wO AO 




79 


oCl^. iXJ UiUZ 


AuAuCAUCUACGGUGAGCG 


Cyclo 


73 


SEQ. ID 0103 


AGCAUCUACGGUGAGCGCU 



Cyclo 


74 


^FO m 0104 


Cyclo 


75 


cpO moios 


Cyclo 


76 


SFO TT) OlOfi 


Cyclo 


11 


TTl 0107 

OxlV<. JUL/ uiU/ 


Cyclo 


78 


wn Tn 0108 

OJl\^. xXJ ViiUo 


Cyclo 


7Q 


Wn TTl 0100 


Cyclo 




^Fn Tn 0110 

OJCrV^, JUL/ Ui IV 


Cyclo 


O X 


Qpn Tn 01 1 1 


Cyclo 


87 


^Fn m 0110 


Cyclo 


O-/ 


wn m oi 1 1 


Cyclo 


84 


cpn m 01 Id 


Cyclo 


8S 


cpo rn oi 1 <c 

OXJV^. XXJ Ul 13 


Cvclo 


8^ 


m oi 1 ^ 

ijJD\^. LL/ Ul 10 


Cyclo 


87 


Qpn Tn 01 T7 


Cyclo 


88 


*\Fn Tn 01 ifi 

OX^/Vc . 11,1 U 1 1 0 


Cyclo 


80 


^FO Tn 01 10 
onv^, 11/ ul ly 


Cyclo 


on 


Tr\ m Oft 
OXIV^. 11/ UIZU 


DB 


1 


<?Fn Tn oioi 

oliV . 11/ U 1 Z 1 


DB 




*\Fn rn oi oo 

oHv^. JUL' uizz 


DB 


J 


cpo tn 01 0*^ 


DB 


A 

*T 


Tr\ m o/i 
oiiv<f . 11^ u i z*t 


DB 


s 


QFn Tn n 1 0 ^ 

OCV^. 11/ UlZj 


DB 


u 


^Fn tn OI 0^ 

oCV^. i±J U i ZD 


DB 


7 


Qpn m 01 07 

OXZ-V^. IL/ U 1 Z / 


DB 


o 


oliv^. Ill UiZo 


DB 


0 


QFO Tn Al OQ 
oriv^. 11/ UiZy 


DB 


10 




DB 


1 1 

1 i 


cpr\ Tnoiii 


DB 


19 


<?T?n m Alio 
oliV^. JX/ Ul jZ 


DB 


u 


cpo inoi'^'i 

ocv^, 11/ Ul^J 


DB 


Id 


cpo Tnoiid 

oCV^. 11^ Ul j*l 


DB 


1 S 


oCl^. ULI UUD 


DB 




cpo Tn O1 1^ 

oCv^. iXJ Ul^O 


DB 


17 


Tnoi'i7 


DB 


18 


Qpo m oin^ 

OCy^. xU UlJo 


DB 


10 
xy 


QFn m oi io 
ocv^. 11/ ui^y 


DB 




QFn tnoidn 

oCV^. 11/ UlnU 


DB 


21 


cpo Tn oidi 

OC\l, MU U1*H 


iJt> 


JLl 


SEQ. ID 0142 


DB 


23 


SEQ.ID0143 



CAUCUACGGUGAGCGCUUC 

UCUACGGUGAGCGCUUCCC 

UACGGUGAGCGCUUCCCCG 

CGGUGAGCGCUUCCCCGAU 

GUGAGCGCTJUCCCCGAUGA 

GAGCGCUUCCCCGAUGAGA 

GCGCUUCCCCGAUGAGAAC 

GCUUCCCCGAUGAGAACUU 

UUCCCCGAUGAGAACUUCA 

CCCCGAUGAGAACUUCAAA 

CCGAUGAGAACUUCAAACU 

GAUGAGAACUUCAAACUGA 

UGAGAACUUCAAACUGAAG 

AGAACUUCAAACUGAAGCA 

AACUUCAAACUGAAGCACU 

CUUCAAACUGAAGCACUAC 

UCAAACUGAAGCACUACGG 

ACGGGCAAGGCCAAGUGGG 

CGGGCAAGGCCAAGUGGGA 

GGGCAAGGCCAAGUGGGAU 

GGCAAGGCCAAGUGGGAUG 

GCAAGGCCAAGUGGGAUGC 

CAAGGCCAAGUGGGAUGCC 

AAGGCCAAGUGGGAUGCCU 

AGGCCAAGUGGGAUGCCUG 

GGCCAAGUGGGAUGCCUGG 

GCCAAGUGGGAUGCCUGGA 

CCAAGUGGGAUGCCUGGAA 

CAAGUGGGAUGCCUGGAAU 

AAGUGGGAUGCCUGGAAUG 

AGUGGGAUGCCUGGAAUGA 

GUGGGAUGCCUGGAAUGAG 

UGGGAUGCCUGGAAUGAGC 

GGGAUGCCUGGAAUGAGCU 

GGAUGCCUGGAAUGAGCUG 

GAUGCCUGGAAUGAGCUGA 

AUGCCUGGAAUGAGCUGAA 

UGCCUGGAAUGAGCUGAAA 

GCCUGGAAUGAGCUGAAAG 

CCUGGAAUGAGCUGAAAGG 



DB 


24 


SEO E)0144 


r^T irjr^ a a t a nr^t a a a r^r^i-\ 
L^UOOAAUOAOCUOAAAGGG 


DB 


25 


SEO ID 0145 


U O u AAUCi AOCU (j AAAG G G A 


DB 


26 


SEO ID 0146 


Oljr UU AVjU U(j AAAuuG AC 


DB 


27 


SEO ID 0147 


OAAUOAOI-^UIIjAAAvjCjGACU 


DB 


28 


SEO ID 0148 


AA UO AOCUO AAAOOu ACUU 


DB 


29 


SEO ID 0140 


Alio AOr^TT/^ A A Ar^r^r* a/^tttt^ 
AU ur AOU UoAAACjCjCj AC UUC 


DB 


30 


SEO ID 0150 


UU Au^^ UIjAAAoOO ACUUCC 


DB 


31 


SEO ID 0151 


vj Au\^ U ij AAAOCjCj ACUUCC A 


DB 


32 


SEO ID 01 57 


A rZf^l 1/^ AAA f^r^r^ A TT tr^r^ a a 

AUC UO AAACjGGACUUCCAA 


DB 


33 


SEO ID015'^ 


Uu AAAOoIj ACUUCCAAG 


DB 


34 


SEO ID 0154 


n TO AAA rzi^n a /^tti t/^o a a /^/^ 
^^UvTAAAuijGAuU UCCAAGG 


DB 


35 


SEO ID 0155 


T TO AAA nf^r^ A /^f T¥ T/^O A A /^/^ A 

UUAAAGGOACUUCCAAGGA 


DB 


36 


SFO ID 01 56 


O AAA A /"TTT T/^i^ A A /^/~« A A 

UAAAuGGACUUCCAAGGAA 


DB 


37 


SFO m 01 S7 


AAA f^f~^i~^ A /^T Tf T#^/^ A A /~ty^ A A 

AAAuuuACUUCCAAGGAAG 


DB 


38 


SEO ID 01 58 


A A i^f^f^ A Tf T/^O A A A A A 

aauvjO AOU UCCAAGG AAGA 


DB 


39 


SEO ID 01 50 

wX^%^< XL^ \J 


A f^f^i^ A /^T TT T/^O A A A A A ¥ t 

AuOU ACU UCCAAGG AAG AU 


DB 


40 


SFO ID 01 60 


ouu ACU UCCAAGGAAGAUG 


DB 


41 


SFO ID 0161 


i~H^ A /~*T TT T/^/^ A A A A A T tr%/^ 

UuAC U UCC AAGG AAG AUGC 


DB 


42 


SFO TO 01 69 

OX.<V^. JLL/ V/1UZ> 


UAC U UOCAAGGAAG AUGCC 


DB 


43 


SFO Tr>016'^ 


A TT T/^i^ A A /^/^ A A A T T/^/^rf^ A 

ACU UCCAAGGAAGAUGCCA 


DB 


44 


SFO TDOIM 


C U u AAGG AAG AUGCCAU 


DB 


45 


SFO TnOlfiS 


T TT T/^/^ A A /^/"^ A A A TT^^y^ A T f 1^ 

U UCCAAGGAAGAUGCCAUG 


DB 


46 


SFO rnoififi 


UCCAAO u AAG AUGCC AUG A 


DB 


47 


SFO rr>oif>7 

OXiV^« XLj' V/IU/ 


C^r^ A A f^i~^ A A A T T/^ /~ty^ A T Xfy A A 

CCAAOOAAGAUGCCAUGAA 


DB 


4R 


SFO IT) 01^18 

OXIiW. XL/ VlOO 


A A A A A T TA~« A ¥ T/~i AAA 

UAAouAAGAUGCCAUGAAA 


DB 


49 


SFO ID 01 f\0 


A A /^/~* A A A T ir^/^e~y a t t/^ a a a ^ 

AAuOAAOAUGCCAUGAAAG 


DB 


50 


SFO rnoiTo 


A A A A T T/~>/~»/^ A T A A A /-I 

AUG AAG AUGCC AUG AAAG C 


DB 


51 


SFO in 0171 

OL^V^. XL/ l/l / Jl 


i^Cl A A A T T/^/^/~« A T T/~» A A A T 

UGAAOAUGCCAUGAAAGCU 


DB 


52 


SFO in 0179 


A A A T T/^ A T T/^ AAA /~l TT T 

UAAGAUGCCAUGAAAGCUU 


DB 


53 


SEO ID 017*^ 

k/-L^\^. XL/ yj L i D 


A A A T T/^ r^t^ A T T/^ AAA r~\t tt t a 

AAGAUGCCAUGAAAGCUUA 


DB 


54 


SEO ID 0174 

OX<\^. XL/ \/l /*T 


A A ¥ T/^/^rf^ A T T/^ AAA TT T A rf-l 

AGAUUCCAUGAAAGCUUAC 


DB 


55 


SEO ID 01 75 

OX.»\^» XL/ \J X 1 J 


O A T Jinf^i^ A T T/^ AAA /^/^T TT T A ^ A 

uAUGCCAUGAAAGCUUACA 


DB 


56 


SFO Tn017fi 

OXiV^. XL/ Vl / o 


A T tf^f^f^ A T T/^ AAA f% Tw T A ATT 

AUGCCAUGAAAGCUUACAU 


DB 


57 


SEO ID 01 77 

h/XrfX^» XL/ \Ml. / / 


T TO/^O A T T/^ AAA /~»rfTT T¥ 7 A A ¥ 

UUUCAUGAAAGCUUACAUC 


DB 


58 


SEO ID017R 

k^X^>^« XL/ Jl / O 


A T T/^ AAA ^^T TT T A ^ A ¥ Tr^ A 

uUCAUuAAAGCUUACAUCA 


DB 


59 


SEO ID 0170 

UX^V^* XL/ Vl /if 


C^i^ A T T/^ AAA ^ /"^ TT T A ^ A T T/^ A A 

CCAUOAAAGCUUACAUCAA 


DB 


60 


SEO IDOIRO 

h/X'X^. XL/ V/iOV 


A ¥ T/^ AAA i^fyj IT T A rf^ A T T^ A A 

L.AUOAAAGCUUACAUCAAC 


DB 


61 


SEO ID 01 81 

»>/*-»\^« XL/ \/ 1 0 1 


A T TO AAA O/^T Tf T A a ¥ t/^ a a a 

AUGAAAuCUUACAUCAACA 


XJl> 


fo 
oz 


oJcvj. ILl Ul o2 


UGAAAGCUUACAUCAACAA 


DB 


63 


SEQ.ID0183 


GAAAGCUUACAUCAACAAA 



bl 



DB 


64 


SEQ.ID0184 


AAAGPTTTTArATTPA APA a \n 


DB 


65 


SEQ.ID0185 


AAGPTJTTAPAITPA Ar*A A Ar'Ti 


DB 


66 


SEQ.ID0186 


AGCUUArAnrA APA A APTTA 


DB 


67 


SEQ. ID 0187 


GCUUAC AT IPAAPAAAniTAr' 


DB 


68 


SEQ. ID 0188 


CUUAPATTPA APA A APT T A A 


DB 


69 


SEQ. ID 0189 


UUACAUCAAPAA AGTTAP A A 


DB 


70 


SEQ. ID 0190 


UACATIPAAPA A AriTTAP A Ar- 


DB 


71 


SEQ. ID 0191 


ACAUCA AP A A AG! T An A A P A 


DB 


72 


SEQ. ID 0192 


CAUCAAPA A AGTTAGA A/^ AP 


DB 


73 


SEQ. ID 0193 


AUCAAPA A AGTTAnA An APP 


DB 


74 


SEQ. ID 0194 


UCAAC A A AGTTAGAAGA ppt t 


DB 


75 


SEQ. ID 0195 


CAACAA AGTTAGA AG APJPTTa 


DB 


76 


SEQ. ID 0196 


AACAAAGTTAGA ArJAPPTTA a 


DB 


77 


SEQ. ID 0197 


APAAAGTTAGA ArSAPiPTTA a a 


DB 


78 


SEQ. ID 0198 


PAAAGTTAGA Ar»APPTT A a 


DB 


79 


SEQ. ID 0199 


AAAGtlAGA AGAGPTTA A APA 


DB 


80 


SEQ. ID 0200 


AAGTTAGA AGAnPTTA A ap a a 
rxiwj \J t\\3£\£\\jijt\\j\^ UAAAO AlA 


DB 


81 


SEQ. ID 0201 


AGTTAGA AGAPSPTTA a ArsA a a 
txyj u r\Kjl\t\\jj\\j \_, UAAAO AAA 


DB 


82 


SEQ. ID 0202 


GIIAGA AGAGPITA A APA AAA 
VJ \J-rs\Ji\t^\jj\\j\^ UAAAO AAAA 


DB 


83 


SEQ. ID 0203 


UAGA AGAGPTTA A APA A A A a 
^ vj y-v/\vj /\vj v.^ U/\AAOAAAlAA 


DB 


84 


SEQ. ID 0204 


AGA AGAGPTTA A APA A A A a a 
v^vj /vrt.vj/\vj U AAAO AAAAAA 


DB 


85 


SEQ. ID 0205 


GAAG AGPTTA AAPAAaaaatt 


DB 


86 


SEQ. ID 0206 


AAGAGPTTA A APA AAAA atta 
>rt-^vj /\vj i-^ u/\/V/\0 AAAAAA U A 


DB 


87 


SEQ. ID 0207 


AGAGPTTA AAPAAAA A ATTAr* 
■tT-vj/^ VJ \^ U/\/\/V\j AAAAAA U AO 


DB 


88 


SEQ. ID 0208 


GAGPTTA A APA AAAA ATT A or' 
vj/^vjV' tJ/\/\/\o/\/\AAAAU AOO 


DB 


89 


SEQ. ID 0209 


AGPTTA A AG A AAAA ATT APPP 


DB 


90 


SEQ. ID 0210 


GPTTA AAGAAAAA ATTA PP/^i^ 
VJ v>» /T-rtL/\vj/\/v/\AAA U AOOOO 


Luc 


1 


SEQ. ID 0211 


AITPPTTPATTA A APPPP A AP A 
-rvuv^v^vJV^/^U/VAAOOOOAAOA • 


Luc 


2 


SEQ. ID 0212 


AGATTPPTTP ATTA A APPPP A A 
^vjr\ u \^ U v-x A U AAAOO OO AA 


Luc 


3 


SEQ. ID 0213 


AGAGATTPPTTPATT A A 
^\jfwjiwj\^\^ U u AAAO O OO 


Luc 


4 


SEQ. ID 0214 


AGAG AGATTPPTTP ATTA A a nr^ 
-'^vjvavj/iLVJ/itU 1.^1-/ UO A U AAAO O 


Luc 


5 


SEQ. ID 0215 


TJCAGAGAGATTPPITPATTA A A 

\> v-'rtvj/\vj/\vi/\u L^O U OAU AAA 


Luc 


6 


SEQ. ID 0216 


AAITPAGAGAP ATTPPT TP ATT A 

-'w VJ v.-'/wj/WJ AO AUOOU OA U A 


Luc 


7 


SEQ. ID 0217 


AAA ATTP AG ArJ A P A T TPPi Tr* A 
^vwvrt.uv.^AVJAVJAOAUOOUOA 


Luc 


8 


SEQ. ID 0218 


GAA A A AT TP AG AP A P ATTPr«i i 
vjjvi-ft^ftuA w V/ AVJAvj AO A U OO U 


Luc 


9 


SEQ. ID 0219 


AAGA A AA ATTPAPap AP ATT/^ 
'vxvj.ft„f\/\/\/\u v.^ AO AO AO A uO 


Luc 


10 


SEQ. ID 0220 


GPA AGA A AA ATTPAPap A/^ a 

VJV^,ft«AOAAAAAUv.^AOAOAOA 


Luc 


11 


SEQ. ID 0221 


ACGP A AG A A AAATTPAPAr»A 
v.* AAVj/VAAAA UO AO AO A 


Luc 


12 


SEQ. ED 0222 


PG A f^ClC* AAPAAAAAT ir^ a ^ a 
V^OAl^OOAAOAAAAAUCAGA 


Luc 


13 


SEQ. ID 0223 


CUCGACGCAAGAAAAAUCA 



T 11/* 
JUUC 


1 A 

14 


bJbQ. ID 0224 


Luc 


1 < 

ID 


bbQ. ID 0225 


Luc 


Xo 


hrx)^, ID 0226 


Luc 


i 1 


btv?. ID 0227 


Luc 




bty. ID 0228 


Luc 


1 C% 


C"C/^ TT% AOOA 

iSiil^. ID 0229 


Luc 


on 


TVS AOIA 

oJbvi- AD Oz30 


Luc 




obQ. ID 0231 


Luc 


L2. 


oiiQ. ID 0232 


Luc 


2.5 


btQ. ID 0233 


Luc 




orii^. ID 0234 


Luc 


ZD 


artV^. ID 0235 


Luc 


zo 


oiiQ- 0236 


Luc 


z/ 


oxiQ. ID 0237 


Luc 




SEQ. ID 0238 


Luc 


Oft 

29 


SEQ. ID 0239 


Luc 


30 


SEQ. ID 0240 


Luc 


31 


SEQ. ID 0241 


Luc 


^o 
3Z 


SEQ. ID 0242 


Luc 


33 


SEQ. ID 0243 


Luc 


34 


SEQ. ID 0244 


Luc 


3!) 


SEQ. ID 0245 


Luc 


3o 


SEQ. ID 0246 


Luc 


3 / 


SEQ. ID 0247 


Luc 


3i> 


SEQ. ID 0248 


Luc 


3y 


SEQ. ID 0249 


Luc 


4U 


SEQ. ID 0250 


Luc 


41 


SEQ. ID 0251 


Luc 


>to 
4Z 


SEQ. ID 0252 


Luc 


43 


SEQ. ID 0253 


Luc 


44 


SEQ. ID 0254 


Luc 


45 


SEQ. ID 0255 


Luc 


A£. 

4o 


SEQ. ID 0256 


Luc 


4/ 


SEQ. ID 0257 


Luc 


4o 


SEQ. ID 0258 


Luc 


AC% 

49 


SEQ. ID 0259 


Luc 




OT?/~V TTX /\0^/\ 

SEQ. ID 0260 


Luc 


CI 

Dl 


SEQ. ID 0261 


Luc 


52 


SEQ. ID 0262 


Luc 


53 


SEQ. ID 0263 



AACUCGACGCAAGAAAAAU 

AAAACUCGACGCAAGAAAA 

GGAAAACUCGACGCAAGAA 

CCGGAAAACUCGACGCAAG 

UACCGGAAAACUCGACGCA 

CUUACCGGAAAACUCGACG 

GUCUUACCGGAAAACUCGA 

AGGUCUUACCGGAAAACUC 

AAAGGUCUUACCGGAAAAC 

CGAAAGGUCUUACCGGAAA 

ACCGAAAGGUCUUACCGGA 

GUACCGAAAGGUCUUACCG 

AAGUACCGAAAGGUCUUAC 

CGAAGUACCGAAAGGUCUU 

GACGAAGUACCGAAAGGUC 

UGGACGAAGUACCGAAAGG 

UGUGGACGAAGUACCGAAA 

UUUGUGGACGAAGUACCGA 

UGUUUGUGGACGAAGUACC 

UGUGUUUGUGGACGAAGUA 

GUUGUGUUUGUGGACGAAG 

GAGUUGUGUUUGUGGACGA 

AGGAGUUGUGUUUGUGGAC 

GGAGGAGUUGUGUUUGUGG 

GCGGAGGAGUUGUGUUUGU 

GCGCGGAGGAGUUGUGUUU 

UUGCGCGGAGGAGUUGUGU 

AGUUGCGCGGAGGAGUUGU 

AAAGUUGCGCGGAGGAGUU 

AAAAAGUUGCGCGGAGGAG 

CGAAAAAGUUGCGCGGAGG 

CGCGAAAAAGUUGCGCGGA 

ACCGCGAAAAAGUUGCGCG 

CAACCGCGAAAAAGUUGCG 

AACAACCGCGAAAAAGXJUG 

GUAACAACCGCGAAAAAGU 

AAGUAACAACCGCGAAAAA 

UCAAGUAACAACCGCGAAA 

AGUCAAGUAACAACCGCGA 

CCAGUCAAGUAACAACCGC 



Luc 


54 




Luc 


55 




Luc 


56 


SEO 10 0266 


Luc 


57 


SEO ID 0267 


Luc 


58 


SEO Tn096R 


Luc 


59 


OXiV^. JUL/ VjC02^ 


Luc 


60 


SFO Tn 0970 


Luc 


61 


SEO rno97i 


Luc 


62 


SEO Tn0979 


Luc 


63 


SEO Tn097'? 


Luc 


64 


SEO TTJ 0974 


Luc 


65 


SEO rn 097^ 


Luc 


66 


SEO rn0976 


Luc 


67 


SEO rn*0977 


Luc 


68 


SEO in 09751 

0£fV^. JUL/ / O 


Luc 


6Q 


SEO TTI 0970 


Luc 


70 


SEO rn 0980 


Luc 


71 


SEO rn 0991 


Luc 


72 


SEO rn 0989 


Luc 


73 




Luc 


74 


SEO m 0984 


Luc 


7S 




Luc 


76 


^FO in 09 8 A 


Luc 


77 


QpO TT> 0989 


Luc 


78 


SEO m 0988 


Luc 


7Q 


SEO m 0980 


Luc 


80 


SEO in 09 QO 


Luc 


81 


Qpn Tn 0901 


Luc 


82 


SEO m O90O 


Luc 


83 


SEO 1TI090'^ 


Luc 


84 


SEO in 090d 


Luc 


85 


SEO in 090^ 


Luc 


86 


SEO in 0906 


Luc 


87 


SEO m 0907 

OxiV^. JJL/ Uity / 


Luc 


88 


SEO m 0908 


JUUC 


oy 


otiVj. ID 0299 


Luc 


90 


SEQ,ID0300 



CGCCAGUCAAGUAACAACC 

GUCGCCAGUCAAGUAACAA 

ACGUCGCCAGUCAAGUAAC 

UUACGUCGCCAGUCAAGUA 

GAUUACGUCGCCAGUCAAG 

UGGAUUACGUCGCCAGUCA 

CGUGGAUUACGUCGCCAGU 

AUCGUGGAUUACGUCGCCA 

AGAUCGUGGAUUACGUCGC 

AGAGAUCGUGGAUUACGUC 

AAAGAGAUCGUGGAUUACG 

AAAAAGAGAUCGUGGAUUA 

GGAAAAAGAGAUCGUGGAU 

ACGGAAAAAGAGAUCXJUGG 

UGACOGAAAAAGAGAUCGU 

GAUGACGGAAAAAGAGAUC 

ACGAUGACGGAAAAAGAGA 

AGACGAUGACGGAAAAAGA 

AAAGACGAUGACGGAAAAA 

GGAAAGACGAUGACGGAAA 

ACGGAAAGACGAUGACGGA 

GCACGGAAAGACGAUGACG 

GAGCACGGAAAGACGAUGA 

UGGAGCACGGAAAGACGAU 

UUUGGAGCACGGAAAGACG 

GUUUUGGAGCACGGAAAGA 

UUGUUUUGGAGCACGGAAA 

UGUUGUUUUGGAGCACGGA 

GUUGUUGUUUUGGAGCACG 

CCGUUGUUGUUUUGGAGCA 

CGCCGUUGUUGUUUUGGAG 

GCCGCCGUUGUUGUUUUGG 

CCGCCXjCCGUUGUUGUUUU 

UCCCGCCGCCGUUGUUGUU 

cuucccgccgccguuguug 
aacuucccgcxjgccguugu 
ugaacuucccgccgccguu 



Example II. Validation of the Algorithm using DBI, Luciferase, PLK, EGFR, 
and SEAP 

The algorithm (Formula VIII) identified siRNAs for five genes, human DBI, 
firefly luciferase (fLuc), renilla luciferase (rLuc), human PLK, and human secreted 
5 alkaline phosphatase (SEAP). Four individual siRNAs were selected on the basis of 
their SMARTscores™ derived by analysis of their sequence using Formula VIII (all 
of the siRNAs would be selected with Formula EX as well) and analyzed for their 
ability to silence their targets' expression. In addition to the scoring, a BLAST search 
was conducted for each siRNA. To minimize the potential for off-target silencing 

1 0 effects, only those target sequences with more fhaa three mismatches against un- 
related sequences were selected. Semizarov, et al, Specificity of short interfering UNA 
determined through gene expression signatures. Proc. Natl. Acad. Sd. U.S.A. 2003, 
100:6347. These duplexes were analyzed individually and in pools of 4 and 
compared with several siRNAs that were randomly selected. The functionality was 

1 5 measured a percentage of targeted gene knockdown as compared to controls. All 

siRNAs were transfected as described by the methods above at 100 nM concentration 
into HEK293 using Lipofectamine 2000. The level of the targeted gene expression 
was evaluated by B-DNA as described above and normalized to the non-specific 
control. Figure 10 shows that the siRNAs selected by the algorithm disclosed herein 

20 were significantly more potent than randomly selected siRNAs. The algorithm 
increased the chances of identifying an F50 siRNA firom 48% to 91%, and an F80 
siRNA fi-om 13% to 57%. In addition, pools of SMART siRNA silence the selected 
target better than randomly selected pools (see Figure lOF). 

25 Example III. Validation of the Algorithm Using Genes Involved in Clathrin- 
Depmdent Endocytosis. 

Components of clathrin-mediated endocytosis pathway are key to modulating 
intracellular signaling and play important roles in disease. Chromosomal 
rearrangements that result in fusion transcripts between the Mixed-Lineage Leukemia 
30 gene (MLL) and CALM (Clathrin assembly lymphoid myeloid leukemia gene) are 
believed to play a role in leukemogenesis. Similarly, disruptions in Rab7 and Rab9, 
as well as HIPl (Huntingtin-intoracting protein), genes that are believed to be 
involved in raidocytosis, are potentially responsible for aibnents resulting in lipid 
storage, and neuronal diseases, respectively. For these reasons, siRNA directed 



against clathrin and other genes involved in the clathrin-mediated endocytotic 
pathway are potentially important research and therapeutic tools. 

siRNAs directed against genes involved in flie clathrin-mediated endocytosis 
5 pathways were selected using Formula VIII. The targeted genes were clathrin heavy 
chain (CHC, accession # NM_004859), clathrin light chain A (CLCa, NM_001833), 
clathrin light chain B (CLCb, NM_001834), CALM (U45976), p2 subunit of AP-2 
(P2,NM_001282), Epsl5 (NM_001981), EpslSR (NM_021235), dynamin II 
(DYNII, NM_004945), RabSa (BC001267), RabSb (NM_002868), Rab5c 
10 (AF141304), and EEA.1 (XM_018197). 

For each gene, four siRNAs duplexes with the highest scores were selected 
and a BLAST search was conducted for eadh of them using the Human EST dateibase. 
In order to minimize the potential for off-target silencing effects, only those 
1 5 sequences with more than three mismatches against un-related sequences were used. 
All duplexes were synthesized at Dharmacon, Inc. as 21-mers with 3'-UU overhangs 
using a modified method of 2'-ACE chemistry Scaringe, Advanced 5'-sifyl-2'- 
orthoester approach to UNA oligonucleotide synthesis. Methods Enzymol 2000, 317:3 
and the antisense strand was chemically phosphorylated to insure maximized activity. 

20 

HeLa cells were grown in Dulbecco's modified Ease's medium (DMEM) 
containing 10% fetal bovine serum, antibiotics and gjutamine. siRNA duplexes were 
resuspended in IX siRNA Universal buffer (Dharmacon, Inc.) to lO^M prior to 
transfection. HeLa cells in 12-well plates were transfected twice with 4^1 of 20m.M 

25 siRNA duplex in 3^1 Lipofectamine 2000 reagent (hivitrogen, Carlsbad, California, 
USA) at 24-hour intervals. For the transfections in which 2 or 3 siRNA duplexes were 
included, the amount of each duplex was decreased, so that the total amount was the 
same as in transfections with single siRNAs. Cells were plated into normal culture 
medium 1 2 hours prior to experiments, and protein levels were measured 2 or 4 days 

30 after the first transfection. 



Equal amounts of lysates were resolved by electrophoresis, blotted, and 
stained with the antibody specific to targeted protein, as well as antibodies specific to 



unrelated proteins, PPl phosphatase and TsglOl (not shown). The cells were lysed in 
Triton X-lOO/glycerol solubilization buffer as described previously. Tebar, 
Bohlander, & Sorkin, Clathrin Assembly Lymphoid Myeloid Leukemia (CALM) 
Protein: Localization in Endocytic-coated Pits, Interactions with Clathrin, and the 
5 Impact of Overexpression on Clathrin-mediated Traffic^ MoL Biol. Cell Aug 1999, 
10:2687. Cell lysates were electrophoresed, transferred to nitrocellulose membranes, 
and Westem blotting was performed with several antibodies followed by detection 
using enhanced chemiluminescence system (Pierce, Inc). Several x-ray films were 
analyzed to determine the linear range of the chemiluminescence signals, and the 
1 0 quantifications were performed using densitometry and Alphalmager v5.5 software 
(Alpha Innotech Corporation). In experiments with Epsl 5R-targeted siRNAs, cell 
lysates were subjected to hranunoprecipitation with Ab860, and EpslSR was detected 
in immunoprecipitates by Westem blotting as described above. 

1 5 The antibodies to assess the levels of each protein by Westem blot were 

obtained firom the following sources: monoclonal antibody to clathrin heavy chain 
(TD.l) was obtained fi-om American Type Culture Collection (Rockville, MD, USA); 
polyclonal antibody to dynamin II was obtained firom Affinity Bioreagents, Inc. 
(Golden, CO, USA); monoclonal antibodies to EEA.1 and RabSa were purchased 

20 firom BD Transduction Laboratories (Los Angeles, CA, USA); the monoclonal 

antibody to TsglOl was purchased firom Santa Cruz Biotechnology, Inc. (Santa Craz, 
CA, USA); the monoclonal antibody to GFP was fi^om ZYMED Laboratories Inc. 
(South San Francisco^ CA^-USA); the rabbit polyclonal antibodies Ab32 specific to a- 
adaptins and Ab20 to CALM were described previously Soricin, et al, Stoichiometric 

25 Interaction of the Epidermal Growth Factor Receptor with the Clathrin-associated 
Protein Complex AP-2, J, Biol. Chem. Jan 1995, 270:619, the polyclonal antibodies to 
clathrin Hght chains A and B were kindly provided by Dr. F. Brodsky (UCSF); 
monoclonal antibodies to PPl (BD Transduction Laboratories) and a-Actinin 
(Chemicon) were kindly provided by Dr. M. DeirAcqua (University of Colorado); 

30 Epsl 5 Ab577 and Epsl5R Ab860 were kindly provided by Dr. P.P. Di Fiore 
(European Cancer Institute). 



67 

Figure 11 demonstrates the in vivo functionality of 48 individual siRNAs, 
selected using Formula VIII (most of them will meet the criteria incorporated by 
Formula DC as well) targeting 12 genes. Various cell lines were transfected with 
siRNA duplexes {Dupl-4) or pools of siRNA duplexes (Pool), and the cells were 
5 lysed 3 days after transfection with the exception of CALM (2 days) and P2 (4 days). 

Note a pl-adaptin band (part of AP-1 Golgi adaptor complex) that runs 
sligjitly slower than P2 adaptin. CALM has two splice variants, 66 and 72 kD. The 
full-length EpslSR (a doublet of -130 kD) and several truncated spliced forms of ~ 

10 1 00 kD and -70 kD were detected in Epsl 5R inmiunoprecipitates (shown by arrows). 
The cells were lysed 3 days after transfection. Equal amounts of lysates were 
resolved by electrophoresis and blotted with the antibody specific to a targeted protein 
(GFP antibody for YFP fusion proteins) and the antibody specific to unrelated 
proteins PPl phosphatase or a-actinin, and TSGlOl . The amount of protein in each 

1 5 specific band was normalized to the amount of non-specific proteins in each lane of 
the gel. Nearly all of them appear to be fimctional, which establishes that Fomula 
VIII and K can be used to predict siRNAs' functionality in general in a genome wide 
manner. 

20 To generate the fiision of yellow fluorescent protein (YFP) with RabSb or 

RabSc (YFP-Rab5b or YFP-Rab5c), a DNA firagment encoding the fiiU-length human 
RabSb or RabSc was obtained by FCR using Pfii polymerase (Stratagene) with a Sacl 
restriction site introduce ihtff^the 5' end and a Kpnl site into the 3' end and cloned 
into pEYFP-Cl vector (CLONTECH, Palo Alto, CA, USA). GFP-CALM and YFP- 

25 RabSa were described previously Tebar, Bohlander, & Sorkin, Clathrin Assembly 

Lymphoid Myeloid Leukemia (CALM) Protein: Localization in Endocytic-coated Pits, 
Interactiom with Clathrin, and the Impact of Overexpression on Clathrin-mediated 
Traffic, Mol. Biol. Cell Aug 1999, 10:2687. 

30 Example IIL Validation of the Algorithm Using Eg5, GADPH, ATEl , MEK2, 
MEKl, QB, LaminA/C, c-myc, human cyclophilin, and mouse cyclophiiin. 



05 

A number of genes have been identified as playing potentially important roles 
in disease etiology. Expression profiles of normal and diseased kidneys has 
implicated Edg5 in immxmoglobulin A neuropathy, a common renal glomerular 
disease. Mycl, MEKl/2 and other related kinases have been associated with one or 
more cancers, while lamins have been implicated in muscular dystrophy and other 
diseases. For these reasons, siRNA directed against the genes encoding these classes 
of molecules would be important research and therapeutic tools. 

Figure 12 illustrates four siRNAs targeting 10 different genes (Table V for 
sequence and accession number information) that were selected according to the 
Fomiula VIII and assayed as individuals and pools in HEK293 cells. The level of 
siRNA induced silencing was measured using the B-DNA assay. These studies 
demonstrated that thirty-six out of tihte forty individual SMART-selected siRNA tested 
are functional (90%) and all 10 pools are folly fimctional. 

Example V. Validation of the Algorithm Using Bcl2 

Bcl-2 is a '-25kD, 205-239 amino acid, anti-apoptotic protein that contains 
considerable homology with other members of the BCL family including BCLX, 
MCLl , BAX, BAD, and BIK. The protein exists in at least two forms (Bcl2a, which 
has a hydrophobic tail for membrane anchorage, and Bcl2b, which lacks the 
hydrophobic tail) and is predominantly localized to the mitochondrial membrane. 
While Bcl2 expression is widely distributed, particular interest has focused on the 
expression of this molecule in B.and T cells. Bcl2 expression is down-regulated in 
nomial germinal center B cells yet in a high percentage of follicular lymphomas, Bcl2 
expression has been observed to be elevated. Cytological studies have identified a 
common translocation ((14;18)(q32;q32)) amongst a high percentage (>70%) of these 
lymphomas. This genetic lesion places the Bcl2 gene in juxtaposition to 
immunoglobulin heavy chain gene (IgH) encoding sequences and is believed to 
enforce inappropriate levels of gene expression, and resistance to programmed cell 
death in the follicle center B cells. In other cases, hypomethylation of the Bcl2 
promoter leads to enhanced expression and again, inhibition of apoptosis. hi addition 
to cancer, dysregulated expression of Bcl-2 has been correlated with multiple sclerosis 
and various neurological diseases. 



ov 

The correlation between Bcl-2 translocation and cancer makes this gene an 
attractive target for RNAi. Identification of siRNA directed against the bcl2 transcript 
(or Bcl2-IgH fusions) would further our understanding Bcl2 gene function and 
possibly provide a future therapeutic agent to battle diseases that result from altered 
5 expression or function of tiiis gene. 

In Silico Identification of Functional sIRNA 

To identify functional and hyperfunctional siRNA against the Bcl2 gene, the 
sequence for Bcl-2 was downloaded from the NCBI Unigene database and analyzed 
10 using the Formula VIII algorithm. As a result of these procedures, both the sequence 
and SMARTscores™ of the Bcl2 siRNA were obtained and ranked according to their 
functionality. Subsequently, these sequences were BLAST'ed (database) to insure that 
the selected sequences were specific and contained minimal overlap with unrealated 
genes. The SMARTscoresfM for the top 10 Bcl-2 siRNA are identified in Figure 13. 

15 

In Vivo Testing of Bcl-2 SiRNA 

Bcl-2 siRNAs having the top ten SMARTscores™ were selected and tested in 
a functional assay to determine silencing efficiency. To accomplish this, each of the 
ten duplexes were synthesized using 2'-0-ACE chemistry and transfected at lOOnM 
20 concentrations into cells. Twenty-four hours later assays were performed on cell 
extracts to assess the degree of target silencing. Controls used in these experiments 
included mock transfected cells, and cells that were transfected with a non-specific 
siRNA duplex. >^ -^>r:^i^ 

25 The results of these experiments are presented below (and in Figure 14) and 

show that all ten of the selected siRNA induce 80% or better silencing of the Bcl2 
message at lOOnM concentrations. These data verify that the algorithm successfiiUy 
identified functional Bcl2 siRNA and provide a set of functional agents that can be 
used in experimental and therapeutic environments. 



siRNA 1 GGGAGAUAGUGAUGAAGUA 
siRNA2 GAAGUACAUCCAUUAUAAG 
siRNA 3 GUACGACAACCGGGAGAUA 
SiRNA 4 AGAUAGUGAUGAAGUACAU 
siRNA 5 UGAAGACUCUGCUCAGUUU 
siRNA 6 GCAUGCGGCCUCUGUUUGA 
siRNA? UGCGGCCUCUGUUUGAUUU 
siRNA 8 GAGAUAGUGAUGAAGUACA 
SiRNA 9 GGAGAUAGUGAUGAAGUAC 
siRNA 10 GAAGACUCUGCUCAGUUUG 



SEQ. ID NO. 301 
SEQ. ID NO. 302 
SEQ. ID NO. 303 
SEQ. ID NO. 304 
SEQ. ID NO. 305 
SEQ. ID NO. 306 
SEQ. ID NO. 307 
SEQ. ID NO. 308 
SEQ. ID NO. 309 
SEQ. ID NO. 310 



10 



Bcl2 siRNA: Sense Strand, 5'->3' 



Example VI. Sequences Selected by the Algorithm 

Sequences of the siRNAs selected using Fonnulas (Algorithms) VIII and DC 
1 5 with their corresponding ranking, which have been evaluated for the silencing activity 
in vivo in the present study (Formula Vlll and DC, respectively). 



TABLE V 



Gene 


Accession 






Fonnula 


Fonnula 


Name 


Number 


SEQ. ID NO. 


FTIlSeqTence 


vm 


IX 


CLTC 


NM_004859 


SEQ. ID NO. 0301 


GAAAGAATCTGTAGAGAAA 


76 


94.2 


cure 


NM_004859 


SEQ. ID NO. 0302 


GCAATGAGCTGTTTGAAGA 


65 


39.9 


CLTC 


NM_004859 


SEQ. ID NO. 0303 


TGACAAAGGTGGATAAATT 


57 


38.2 


CLTC 


NM_004859 


SEQ.IDNO..0304 


GGAAATGGATCTCTITGAA 


54 


49.4 


CLTA 


NM_001833 


SEQ. ID NO. 0305 


GGAAAGTAATGGTCCAACA 


22 


55.5 


CLTA 


NM_001833 


SEQ. ID NO. 0306 


AGACAGTTATGCAGCTATT 


4 


22.9 


CLTA 


NM_001833 


SEQ. ID NO. 0307 


CCAATTCTCGGAAGCAAGA 


1 


17 


CLTA 


NM_001833 


SEQ. ID NO. 0308 


GAAAGTAATGGTCCAACAG 


-1 


-13 


CLTB 


NM_001834 


SEQ. ID NO. 0309 


GCGCCAGAGTGAACAAGTA 


17 


57.5 


CLTB 


NM_001834 


SEQ. ID NO. 03 10 


GAAGGTGGCCCAGCTATGT 


15 


-8.6 


CLTB 


NM_001834 


SEQ. ID NO. 03 11 


GGAACCAGCGCCAGAGTGA 


13 


40.5 


CLTB 


NM_001834 


SEQ. ED NO. 0312 


GAGCGAGATTGCAGGCATA 


20 


61.7 


CALM 


U45976 


SEQ:EDN0.0313 


GTTAGTATCTGATGACTTG 


36 


-34.6 


CALM 


U45976 


SEQ. ID NO. 0314 


GAAATGGAACCACTAAGAA 


33 


46.1 


CALM 


U45976 


SEQ. ID NO. 0315 


GGAAATGGAACCACTAAGA 


30 


61.2 


CALM 


U45976 


SEQ. ID NO. 03 16 


CAACTACACTTTCCAATGC 


28 


6.8 


EPS15 


NM_001981 


SEQ. ID NO. 0317 


CCACCAAGATTTCATGATA 


48 


25.2 



71 



EPS 15 


NM_001981 


SEQ.1DNO.0318 


EPS 15 


NM_001981 


SEQ.IDNO.0319 


EPS15 


NM:_001981 


SEQ.IDNO.0320 


BPS15R 


NM_021235 


SEQ.IDNO.0321 


EPS15R 


NM_021235 


SEQ.IDNO.0322 


EPS15R 


NM_021235 


SEQ.IDNO.0323 


EPS15R 


NM_021235 


SEQ.IDNO.0324 


DNM2 


NM_004945 


SEQ.1DNO.0325 


DNM2 


NV[_004945 


SEQ.IDNO.0326 


DNM2 


NM_004945 


SEQ.IDNO.0327 


DNM2 


NM_004945 


SEQ.IDNO.0328 


ARF6 


AF93885 


SEQ.IDNO.0329 


ARF6 


AF93885 


SEQ.IDNO.0330 


ARF6 


AF93885 


SEQ.JPNO.0331 


ARF6 


AF93885 


SEQ.IDNO.0332 


RAB5A 


BC001267 


SEQ.IDNO. 0333 


RAB5A 


BC001267 


SEQ.1DNO.0334 


RAB5A 


BC001267 


SEQ.IDNO. 0335 


RAB5A 


BC001267 


SEQ.IDNO. 0336 


RAB5B 


NM_002868 


SEQ.IDNO. 0337 


RAB5B 


NM_002868 


SEQ.IDNO. 0338 


RAB5B 


NM_002868 


SEQ.IDNO. 0339 


RAB5B 


NM_002868 


SEQ.IDNO. 0340 


RAB5C 


AF141304 


SEQ.IDNO. 0341 


RAB5C 


AF141304 


SEQ.ro NO. 0342 


RAB5C 


AF141304 


SEQ.ro NO. 0343 


RAB5C 


AF141304 


SEQ.ro NO. 0344 


EEAl 


XM_0I8197 


SEQ.ro NO. 0345 


EEAl 


X^4_0^8197 


SEQ.ro NO. 0346 


EEAl 


XM_018197 


SEQ.ro NO. 0347 


EEAl 


XM_018197 


SEQ.ro NO. 0348 


AP2B1 


NM_001282 


SEQ.ro NO. 0349 


AP2B1 


NM_001282 


SEQ. ro NO. 0350 


AP2B1 


NM_001282 


SEQ.ro NO. 0351 


AP2B1 


NM_001282 


SEQ.ro NO. 0352 


PLK 


MM_005030 


SEQ.ro NO. 0353 


PLK 


NM_005030 


SEQ.ro NO. 0354 


PLK 


NM_005030 


SEQ.ro NO. 0355 


PLK 


NM_005030 


SEQ.ro NO. 0356 


GAPDH 


NM_002046 


SEQ.ro NO. 0357 



A uuo/\yvi^ 1 CCAACAAG A 


43 


49.3 


*^^nJ^y-^\J\Jr\\J\^ 1 Av^AO A 1 1 AT 


39 


11.5 


CCAPAPArjr' ATTr»TT'/^T' A A 


33 


-23.6 


vj.niT.\j 1 1 J 1 OAvjC AATC 


48 


33 


GClKCT^CiC\or*fiATnn a ^ a a 
vjvj-rvv^ 1 1 VJVJ^I^OA 1 CCACj AA 


27 


33 


vjv^/\i^i 1 uuAl ^vjACjATGAG 


20 


1.3 


CAAAfwAPPA A TTT^o r^/"»T^ A 


17 


27.7 


PPtr A ATP A A Ti^r^r^ a TrvT^/i 
^v^vj/\A 1 u AA 1 COCA rCTTC 


6 


-29.6 


OAPATn ATPPX/^r* A r^T^rx a 
1 OA 1 cc 1 IjOAuTTCA 


5 


-14 


vj/\vj\^OAAi CO 1 OALCACTT 


5 


24 


* v/^vj Aoc 1 OOCO I CTAC 


-4 


-63.6 


TC AP ATrjrJTT A A r'nrr'T' a a 
J. vjLr 1 1 AACC 1 CTAA 


27 


-21.1 


vj/\ 1 vrAOOOAOOCC ATAATC 


7 


-38.4 


PPTPT A A A A a a >T*r^nm A 

K^K^ 1 c X AAC 1 ACAAATCTTA 


4 


16.9 


vjvj/^/\uo 1 oo 1 A 1 oCAAAAT 


4 


11.5 


r?P A Ar3P A A nT^r^T> a a y-i a 

vj^^AAuoAAO 1 CCTAACATT 


40 


25.1 


vjvj/\AU AOO AO 1 AG ACCTTA 


17 


50.1 


AouAAi CAOTGTTGTAGTA 


16 


11.5 


P A A O A PP A A A /-lyviwn A ^ 

vji'^UAUO AO 1 AGACCTTAC 


12 


7 


P AAA PT'O a a /^/^yTnp/*^/-nT» a nw* 

vjAAAvj 1 CAAOCCTGGTATT 


14 


18.1 


-rt-rtAvj 1 CAAOCC^TGGTATTA 


6 


-17.8 


vjL^ 1 A X OAACO X GAATG ATC 


3 


-21.1 


v^/vft.oi^c X oo X A 1 rACGTTT 


-7 


-37.5 


PP A A P A K Tr^r^'Tr^ a a r-r*^ 

ovj AAC AAO A 1 C rOTC AATT 


38 


51.9. 


o^^AAX OAACOTGAACGAAA 


29 


43.7 


PA ATP A A r^i^'T/^ A A A A k rry 

1-.AA X OAACO 1 GAACG AAAT 


18 


43.3 


OP AP APP A i^r^r^i^'T a t*/^ a ^ * 

ovjAi^AOOAOCOGTATCACA 


6 


18.2 


APAP A/^ A/^/^TTTP/^ A A A <n A A 

AOACAOAOC ITGAGAATAA 


67 


64.1 


P APA AfS ATV^T^I'l' A 'vr^r^ a a a 

vjAVjAAOAX O XTTATGCAAA 


60 


48.7 


PA APAP A A ATr* A A /-^ A rri A 

vj/v^\*jAOAAAX CAGCAGATA 


58 


45.7 


PPA APTA A r^f^ A A A A A 

vjv^AAu 1 AAO 1 0 AACTAAC A 


56 


72.3 


P APPTA AT'rf'"*T/^o/*^ A A 

vji^vju X AA X o 1 OCCAC ATTG 


49 


-12.4 


PPAPATn A/^TTT* A /^T^ A A A A 

vjv^ AO A X o AO 1 rACTAG AAA 


44 


48.9 


PA APTTA ATX/ixT'/^ A /-« A A A 
v^/^^v^ 1 .X AA X X 0 1 UCAGAAA 


41 


28.2 


PA APAPAnr^ ATTrv^T'r' A A a 
v^.f\/\v^/\^^/v<jljAx ICTGATAlA 


33 


-5.8 


APATTPTPP/'^A Ar^Trwrvry 
-f^vj/i. 1 X o X OOO X AAOTCTCT 


-35 


-3.4 


ATP A A P ATV^^/^ A /~»y^»T>^ A A 

A 1 vjAAo A X CI OG AGGTGAA 


0 


-4.3 


TTTPAPAPTT'/'*TT'i^i^/-vT« A A 

J. 1 ivj/\oAcx xci lOCCTAA 


-5 


-27.7 


AG ATP A PPPTrTT^p A A AT" A 
^vji-i 1 \-*/\v^v^i^ X CC X X AAATA 


15 


72.3 


CAACGGATTTGGTCGTATT 


27 


-2.8 



GAPDH 


NM_002046 


SEQ.IDNO. 0358 


GAPDH 


NM_002046 


SEQ.IDNO. 0359 


GAPDH 


NM_002046 


SEQ.IDNO. 0360 


c-Myc 




SEQ.IDNO. 0361 


c-Myc 




SEQ.IDNO. 0362 


c-Myc 




SEQ.IDNO. 0363 


c-Myc 




SEQ.IDNO. 0364 


MAP2K1 


NM_002755 


SEQ.IDNO. 0365 


MAP2K1 


NMJ002755 


SEQ. ID NO. 0366 


MAP2K1 


NM_002755 


SEQ.IDNO. 0367 


MAP2K1 


]NM_002755 


SEQ.IDNO. 0368 


MAP2K2 


]SIM_030662 


SEQ.IDNO. 0369 


MAP2K2 


NM_030662 


SEQ.IDNO. 0370 


MAP2K2 


NM_030662 


SEQ.IDNO. 0371 


MAP2K2 


NM__030662 


SEQ.IDNO. 0372 


KNSL1(EG5) 


NM_004523 


SEQ.IDNO. 0373 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0374 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0375 


KNSL1(EG5) 


NM_004523 


SEQ, ID NO. 0376 


CyclophilinA_ 


NM_021130 


SEQ. ID NO. 0377 


CycIophilinA_ 


NM_021I30 


SEQ. ID NO. 0378 


CyclophilinA_ 


NM_021130 


SEQ. ID NO. 0379 


CycIophilinA_ 


NM_021130 


SEQ.IDNO. 0380 


LAMINA/C 


MM_170707 


SEQ.IDNO. 0381 


LAMINA/C 


NM_1 70707 


SEQ. ID NO. 0382 


LAMINA/C 


NM_1 70707 


SEQ; id NO. 0383 


LAMINA/C 


I^_l 70707 


SEQ.IDNO. 0384 


CyclophilinB 


M60857 


SEQ.IDNO. 0385 


CyclophilinB 


M60857 


SEQ. ID NO. 0386 


CyclophilinB 


M60857 


SEQ.IDNO. 0387 


CyclophilinB 


M60857 


SEQ.IDNO. 0388 


DBII 


NM_020548 


SEQ.IDNO. 0389 


DBI2 


NM_020548 


SEQ.IDNO. 0390 


DBI3 


NM_020548 


SEQ. ID NO. 0391 


DBI4 


NM_020548 


SEQ.IDNO. 0392 


rLUCl 




SEQ.IDNO. 0393 


rLUC2 




SEQ. ID NO. 0394 


rLUC3 




SEQ.IDNO. 0395 


rLUC4 




SEO.IDNO 0396 


SeAPl 


NM__031313 


SEQ.IDNO. 0397 



GAAATCPC ATP Apr A TnTT 


24 


3.9 


GACCTPA APTAP ATr:r:TTT^ 


22 


-22.9 


TGGTTTAP ATfJTTPP A a t a 


9 


9.8 


GAAGA A ATPG ATnTTrsTxr 


31 


-11.7 


AC AP A A A PTTr; A A r» A nr^ A 


22 


51.3 


GGA AG A A ATPn A irnirrnrrT 


18 


26 


GAAAPnAPnArSA APAn'nrr' 


18 


-8.9 


GP AP ATOn ATnn a rznrrr'n^ 


26 


16 


GCAG AG AG AfiP A O A 'Vfvn a 


16 


0.4 


GAGfrTTPTPTrinATY^A a/^t 


14 


15.5 


GAGCAG ATTTG A A GP A A r'T 


14 


18.5 


PAAAGAPGATGAr*Txr*#^ A a 


37 


26.4 


GATPAGPATTTGr'A'mnA a 


24 


-0.7 


TPPAGGAGTTTOTT'a at* a a 


17 


-4.5 


GGA AGPTG ATPr* A rrnrrr* a 


16 


59.2 


GPAGA A ATPTA A nr: AX at a 
x\^x AAOvjA 1 a I a 


53 


35.8 


CAAPA AGGATGA A riTC^ a t 


50 


18.3 


PAGPAGA A ATT^TA A/^/^ ATA 


41 


32.7 


PTAG ATGGPTTTr^r* A r^T A 


39 


3.9 


AGAP A AririTrTT* a a Ar^ a o a 
r\.\j/\\^i\j\KjKj 1 v^v^CAAAvj ACA 


-16 


58.1 


GG A A TTirJP A A n A r^r^ a r^r^ a a 
uo/\Al OLiI^AAvjACCACjCAA 


-6 


36 


AG A ATT'ATTPP' An/^riTTTA 


-3 


16.1 


GPAGAPA Adnimnn A a An \ 

yj^^K3Jxy-'r\r\\J\J X L'L^V^AAAvjA 


8 


8.9 


AG A AGP Arir^TTr* a or^ ATT* A 


31 


38.8 


GAGPTm Ar^T^rrT' Ar^ A a/^a 


33 


22.4 


PPAPPGA AriTTr'Ar^r^r'T A a 
v-'V,^/^v-'V-'ijf/\/\.u 1 1^AL/UV-/1AA 


21 


27.5 


GAGA AG AGPTPn^/^r* ATr* a 


55 


30.1 


GA A AGAnr^ATr^T A/^'i^/^T^ a 


41 


83.9 


GA A AGG ATTT/^r2/"^A/^ A a a 
111 oOv^ 1 AC AAA 


53 


59.1 


APAGP A A ATTrr* ATO/^T/^T 


-20 


28.8 


GGA A AGAPTf^TTrT' A a a a a 
1 VJ A 1 CL/AAAAA 


2 


27 


CAAPA PGPPTO A Tr*r^r^ a 
^'^^^-^^vxvJl./C 1 CA 1 CC i d A 


27 


-7.6 


PATGA A Ar;r*TTAr* ATr« A Ar^ 
1 vj/vAAUC 1 1 AO A 1 C AAC 


25 


-30.8 


AAG ATGPP A Tn AAA rr^TT a 
1 VJV<»I-^ A 1 VJAAAvjC X 1 A 


17 


22 


GCAC ATA PPGPn^4^ a nrrm* 

^ ^-rw^^ X /W^l^VJV-.C 1 VJ AVJ 1 CI 


1 c 

15 


3.9 


G ATP A A ATm^n a a a a r*r^ a 
vj-fv 1 v^/iAA 1 CI vr AAvj AAvjGA 


57 


49.2 


GPPA AGA AryTTT/^r^T'A ata 
vjv^v^/\/vvjAAtj ill CC 1 AATA 


50 


13.7 


CAGC AT ATPTTr^ a Arr'ATx' 
v^rxvjVp^i^i 1 1 IjAACCAI x 


41 


-2.2 


G A A P A A A nt^ AAA /^/^/^ A nn^ a 

u/\AUAAAOuAAACGGATGA 


39 


29.2 


CGGAAACGGTCCAGGCTAT 


6 


26.9 



/J 

SeAP2 NM_031313 SEQ. ID NO. 0398 

SeAP3 NM_031313 SEQ. ID NO. 0399 

SeAP4 NM_031313 SEQ. ID NO. 0400 

fLUCl SEQ. ID NO. 0401 

£LUC2 SEQ. ID NO. 0402 

fLUC3 SEQ. ID NO. 0403 

fLUC4 SEQ. ID NO. 0404 

mCycloJ NM_008907 SEQ. ID NO. 0405 

mCyclo_2 NM_008907 SEQ. ID NO. 0406 

mCyclo_3 NM_008907 SEQ. ID NO. 0407 

inCyclo_4 NM_008907 SEQ. ID NO. 0408 

BCL2_1 NM_000633 SEQ. ED NO. 0409 

BCL2_2 NM_000633 SEQ. ID NO. 0410 

BCL2_3 NM_000633 SEQ. ID NO. 04.11 

BCL2_4 NM_000633 SEQ. ID NO. 04 1 2 

BCL2_5 NM_000633 SEQ. ID NO. 0413 

BCL2_6 NM_000633 SEQ. ID NO. 0414 

QBl NM_003365.I SEQ. ID NO. 0415 

QB2 NM_003365.1 SEQ. ID NO. 0416 

QB3 NM_003365.1 SEQ. ID NO. 0417 

QB4 NM_003365.1 SEQ. ED NO. 0418 

ATEl-1 NM_007041 SEQ. ID NO. 0419 

ATEl-2 NM_007041 SEQ. ID NO. 0420 

ATEl-3 . . NM_007041 SEQ. ID NO. 0421 

ATEl-4 NM_007041 SEQ. ID NO. 0422 

EGFR-1 NM_005228 SEQ. ID NO. 0423 

EGFR-1 NM_005228 SEQ. ID NO. 0424 

EGFR-1 NM_005228 SEQ. ID NO. 0425 

EGFR-1 NM_005228 SEQ. ID NO. 0426 



GCTTCGAGCAGACATGATA 


4 


-11.2 


CCTACACGGTCCTCCTATA 


4 


4.9 


GCCAAGAACCTCATCATCT 


1 


-9.9 


GATATGGGCTGAATACAAA 


54 


40.4 


GCACTCTGATTGACAAATA 


47 


54.7 


TGAAGTCTCTGATTAAGTA 


46 


34.5 


TCAGAGAGATCCTCATAAA 


40 


11.4 


GCAAGAAGATCACCATTTC 


52 


46.4 


GAGAGAAATTTGAGGATGA 


36 


70.7 


GAAAGGATTTGGCTATAAG 


35 


-1.5 


GAAAGAAGGCATGAACATT 


27 


10.3 


GGGAGATAGTGATGAAGTA 


21 


72 


GAAGTACATCCATTATAAG 


1 


3.3 


GTACGACAACCGGGAGATA 


1 


35.9 


AGATAGTGATGAAGTACAT 


-12 


22.1 


TGAAGACTCTGCTCAGTTT 


36 


19.1 


GCATGCGGCCTCTGTTTGA 


5 


-9.7 


GCACACAGCUUACUACAUC 


52 


-4.8 


GAAAUGCCCUGGUAUCUCA 


49 


22.1 


GAAGGAACGUGAUGUGAUC 


34 


22.9 


GCACUACUCCUGUGUGUGA 


28 


20.4 


GAACCCAGCUGGAGAACUU 


45 


15.5 


GAUAUACAGUGUGAUCUUA 


40 


12.2 


GUACUACGAUCCUGAUUAU 


37 


32.9 


GUGCCGACCUUUACAAUUU 


35 


18.2 


GAAGGAAACTGAATTCAAA 


68 


79.4 


GGAAATATGTACTACGAAA 


49 


49.5 


CCACAAAGCAGTGAATTTA 


41 


7.6 


GTAACAAGCTCACGCAGTT 


40 


25.9 



Example VII. Genome-Wide Application of the Algorithm 

The examples desaibed above demonstrate that the algorithm(s) can 
5 successfully identify functional siRNA and that these duplexes can be used to induce 
the desirable phenotype of transcriptional knockdown or knockout. Each gene or 
family of genes in each organism plays an important role in maintaining physiological 
homeostasis and the algorithm can be used to develop functional, highly functional, or 
hyperfimctional siRNA to each gaie. To accomplish this for the human genome, the 
1 0 entire online ncbi refseq database was accessed through Entrez (efetch). The database 



was processed through Formula VIII. For each gene the top 80 -1 00 scores for 
siKNAs were obtained and BLAST'ed to insure that the selected sequences are 
specific in targeting the gene of choice. These sequences are provided on the 
enclosed CDs in electronic form. Accordingly, Applicants hereby incorporate by 
5 reference the material submitted herewith, in diqplicate on the compact disks labeled 
COPY 1 - TABLES PART, DISK 1/1, TABLES 12-15, Filed with RO/US under 
PCT AI sec. 801(a), Operating System: MS-Windows, COPY 2 - TABLES PART, 
DISK 1/1, TABLES 12 -15, Filed with RO/US under PCT AI sec. 801(a), Operating 
System: MS-Windows, COPY 3 - TABLES PART, DISK 1/1, TABLES 12 -15, 

10 Filed with RO/US under PCT AI sec. 801(a), Operating System: MS-Windows,; 
which copies are identical, in files entitled Table_12.txt, date of creation June 26, 
2003, with a size of 31,045 kb; Table_13.txt, date of creation November 13, 2003, 
with a size of 78,451 kb; Table_14.txt, date of creation November 13, 2003, with a 
size of 454 kb; and Table_15.txt date of creation November 13, 2003, with a size of 

15 1,690 kb. 

With respect to Ihe disks, there are four tables on each disk copy in text 
format: Tables XII -XV. Table XII, which is located in a file entitled Table_12.txt, 
provides a list of the 80-100 sequences for each target, idoitified by Formula VIII as 

20 having the highest relative SMARTscores™ for the target analyzed. Table XIII, 

which is located in a file entitled Table_13.txt, provides tiie SMARTscores™, and for 
each gene, a pool pick of up to four sequences is denoted. (The denotation of"!" in 
Table Xin means that it is a pool pick.) These pool pick sequences represent the most 
fimctional siRNAs for the corresponding target. Any 1, 2, 3, or 4 of the pool pick 

25 sequences could be used for gene silencing. Further, sequences that are not denoted 
as pool pick sequences, but that are included on the compact disks may also be used 
for gene silencing either alone or in combination with other sequences. However, 
their individual relative functionality would be less than that of a pool pick sequence. 
Table XIV, which is located in a file entitied Table_14.txt, provides an identification 

30 of genes by accession numbar, and Table XV, which is located m a file entitled 
Table l 5.txt, provides a short name for the genes identified on the disk. The 
information contained on the disks is part of this patent application and are 
incorporated into the specification by reference. One may use these tables in order to 
identify functional siRNAs for the gene provided therein, by simply looking for the 



/3 



gene of interest and an siRNA that is listed as functional. Preferably, one would 
select one or more of the siRNA that most optimized for the target of interest and is 
denoted as a pool pick. 

Table XII: siRNA selected by Formula Vni 

See data submitted herewith on a CD-ROM in accordance with PCT 
Administrative Instructions Section 801(a) 

Table XIH: SMARTscores™ 

See data submitted herewith on a CD-ROM in accordance with PCT 
Administrative Instructions Section 801(a) 

Table XIV: Identification of Targets 

See data submitted herewith on a CD-ROM in accordance with PCT 
Administrative Instructions Section 801(a) 

Table XV: Description of Targts 

See data submitted herewiA on a CD-ROM in accordance with PCT 
Administrative Instructions Section 801(a) 

Many of the genes to which the described siRNA are directed play critical 
roles in disease etiology. For this reason, the siRNA listed in the accompanying 
compact disk may potentially act as therapeutic agents.. Anumber of prophetic 
examples follow and should be understood in view of the siRNA that are identified on 
the accompanying CD. To isolate these siRNA, the appropriate message sequence for 
each gene is analyzed using one of the before mentioned formulas (preferably formula 
VIII) to identify potential siRNA targets. SubsequenUy these targets are BLAST'ed to 
eliminate homology with potentially off-targets. 

The list of potential disease targets is extensive. For instance, over-expression 
of BcllO has been implicated in the development of MALT lymphoma (mucosa 
associated lymphoid tissue lymphoma) and thus, functional, highly functional, or 
hyperfunctional siRNA directed against that gene (e.g. SEQ. ID NO. 0427: 
GGAAACCUCUCAUUGCUAA; SEQ. ID NO. 0428: 



to 

GAAAGAACCUUGCCGAUCA; SEQ. ID NO. 0429: 
GGAAAUACAUCAGAGCUUA, or SEQ. ID NO. 0430: 
GAAAGUAUGUGUCUUAAGU) may contribute to treatment of this disorder. 

5 In another example, studies have shown that molecules that inhibit 

glutamine:fructose-6-phosphate aminotransferase (GFA) may act to limit the 
symptoms suffered by Type II diabetics. Thus, functional, highly functional, or 
hyperfunctional siRNA directed against GFA (also known as GFPTl: siRNA = SEQ. 
ID NO. 0433 UGAAACGGCUGCCUGAUUU; SEQ. ID NO. 0434 
1 0 GAAGUUACCUCUUACAUUU; SEQ. ID NO. 0435 
GUACGAAACUGUAUGAUUA; SEQ. ID NO. 0436 
GGACGAGGCUAUCAUUAUG) may contribute to treatment of this disorder. 

In another example, the von Hippel-Lindau (VHL) tumor suppressor has been 

1 5 observed to be inactivated at a high frequency in sporadic clear cell renal cell 
carcinoma (RCC) and RCCs associated with VHL disease. The VHL tumor 
suppressor targets hypoxia-inducible factor-1 alpha (HBF-l alpha), a transcription 
factor that can induce vascular endothelial growth factor (VEGF) expression, for 
ubiquitination and degradation. Inactivation of VHL can lead to increased levels of 

20 HIF-1 alpha, and subsequent VEGF over expression. Such over expression of VEGF 
has been used to explain the increiased (and possibly necessary) vascularity observed 
in RCC. Thus, functional, highly functional, or hyperfunctional siRNAs directed 
against either HIF-1 alpha (SEQ. ID NO. 0437 GAAGGAACCpGAUGCUUUA; 
SEQ. ID NO. 0438 GCAUAUAUCUAGAAGGUAU; SEQ. ID NO. 0439 

25 GAACAAAUACAUGGGAUUA; SEQ. ID NO. 0440 

GGACACAGAUUUAGACUUG) or VEGF (SEQ. ID NO. 0441 
GAACGUACUUGCAGAUGUG; SEQ. ID NO. 0442 
GAGAAAGCAUUUGUUUGUA; SEQ. ID NO. 0443 
GGAGAAAGCAUUUGUUUGU; SEQ. ID NO. 0444 

30 CGAGGCAGCUUGAGUUAAA) may be useful in the treatment of renal cell 
carcinoma. 

In another example, gene expression of platelet derived growth factor A and B 
(FDGF-A and PDGF-B) has been observed to be increased 22- and 6-fold, 



77 



respectively, in renal tissues taken from patients with diabetic nephropathy as 

compared with controls. These findings suggest that over expression of PDGF A and 

B may play a role in the development of the progressive fibrosis that characterizes 

human diabetic kidney disease. ITius, fimctional, highly fimctional, or hyperfimctional 

siRNAs directed against either PDGF A 

(SEQ. ID NO. 0445: GGUAAGAUAUUGUGCUUUA; 

SEQ. IDNO. 0446: CCGCAAAUAUGCAGAAUUA; 

SEQ. E) NO. 0447: GGAUGUACAUGGCGUGUUA; 

SEQ. ID NO. 0448: GGUGAAGUUUGUAUGUUUA) or 

PDGFB 

(SEQ. IDNO. 0449: CCGAGGAGCUUUAUGAGAU; 
SEQ. ID NO. 0450: GCUCCGCGCUUUCCGAUUU; 
SEQ. ID NO. 0451 GAGCAGGAAUGGUGAGAUG; 
SEQ. ID NO. 0452: GAACUUGGGAUAAGAGUGU; 
SEQ. ID NO. 0453 CCGAGGAGCUUUAUGAGAU; 

SEQ. ID NO. 0454 UUUAUGAGAUGCUGAGUGA) may be useful in the treatment 
of this form of kidney disordo-. 



hi another example, a strong correlation exists between the over-expression of 
glucose transporters (e.g. GLUT12) and cancer cells. It is predicted that cells 
undergoing uncontrolled cell growth iqvregulate GLUT molecules so that they can 
cope with the heightened energy needs associated with increasedrates^f proliferation 
and metastasis. Thus, siRNA-based therapies that target the molecules such as 
GLUTl (also known as SLC2A1: siRNA= 
SEQ. ID NO.: 0455 GCAAUGAUGUCCAGAAGAA; 
SEQ. ID NO.: 0456 GAAGAAUAUUCAGGACUUA; 
SEQ. IDNO.: 0457 GAAGAGAGUCGGCAGAUGA; 
SEQ. ID NO.: 0458 CCAAGAGUGUGCUAAAGAA) 

GLUT12 (also known as SLCA12: siRNA = 
SEQ. ID NO. 0459: GAGACACUCUGAAAUGAUA; 
SEQ. ID NO. 0460: GAAAUGAUGUGGAUAAGAG; 
SEQ. ID NO. 0461: GAUCAAAUCCUCCCUGAAA; 



SEQ. ID NO. 0462: UGAAUGAGCUGAUGAUUGU) and other related transporters, 
may be of value in treating a multitude of malignancies. 

The siRNA sequences listed above are presented in a 5'-^ 3' sense strand 
direction. In addition, siRNA directed against the targets listed above as well as those 
directed against other targets and listed in the accompanying compact disk maybe 
useful as ther^eutic agents. 

Example VHI. Evidence for the Benefits of Pooling 

Evidence for the benefits of pooling have been demonstrated using the 
reporter gene, luciferase. Ninety siRNA duplexes were synthesized using Dharmacox, 
proprietary ACE® chemistry against one of the standard reporter genes: firefly 
luciferase. The duplexes were designed to start two base pairs apart and to cover 
approximately 180 base pairs of the luciferase gene (see sequences in Table III) 
Subsequently, the siRNA duplexes were co-transfected with a luciferase expression 
reporter plasmid into HEK293 cells using standard transfection protocols and 
luciferase activity was assayed at 24 and 48 hours. 

Transfection of individual siRNAs showed standard distribution of inhibitory 
effect. Some duplexes were-active, while others were not. Figure 15 represents a 
typical screen of ninety siRNA duplexes (SEQ. ID NO. 0032- 0120) positioned two 
base pairs apart. As the figure suggests, the functionality of the siRNA duplex is 
determined more by a particular sequence of the oligonucleotide than b.y^e, relative 
oligonucleotide position within a gene or excessively sensitive part of the mRNA, 
which is important for traditional anti-sense technology. 

When two continuous oligonucleotides were pooled together, a significant 
increase in gene silencing activity was observed. (See Figure 16) A gradual increase 
m eflScacy and the frequency of pools functionality was observed when the number of 
s>RNAs increased to 3 and 4. (Figures 16, 17). Further, the relative positioning of 
the oligonucleotides within a pool did not detemune whether a particular pool was 
functional (see Figure 18, in which 100% of pools of oligonucleotides distanced by 2, 
1 0 and 20 base pairs were functional). 



However, relative positioning may nonetheless have an impact. An increased 
functionality may exist when the siRNA are positioned continuously head to toe (5' 
end of one directly adjacent to the 3' end of the others). 

Additionally, siRNA pools that were tested performed at least as well as the 
best oligonucleotide in the pool, under the experimental conditions whose results are 
depicted in Figure 19. Moreover, when previously identified non-fiinctional and 
marginally (semi) functional siRNA duplexes were pooled together in groups of five 
at a time, a significant functional cooperative action was observed. (See Figure 20) 
In fact, pools of semi-active oligonucleotides were 5 to 25 times more fijnctional than 
the most potent oligonucleotide in the pool. Therefore, pooling several siRNA 
duplexes together does not interfere with the functionaUty of the most potent siRNAs 
within a pool, and pooling provides an unexpected significant increase in overall 
fimctionality 



Example IX. Pooling Across Species 

Experiments were performed on the following genes: P-galactosidase, Renilla 
luciferase, and Secreted alkaline phosphatase, which demonstrates the benefits of 
pooling, (see Figure 21) Approximately 50% of individual siRNAs designed to 
silence the above-specified genes^were functional, while 100% of the pools that 
contain the same siRNA duplexes were fimctional. 

Example X. Highly Functional siRNA 

Pools of five siRNAs in which each two siRNAs overlap to 10-90% resulted 
in 9S% fimctional entities (>80% silencing). Pools of siRNAs distributed throughout 
the mRNA that were evenly spaced, covering an approximate 20 - 2000 base pair 
range, were also fimctional. When the pools of siRNA were positioned continuously 
head to tail relative to mRNA sequences and mimicked tiie natural products of Dicer 
cleaved long double stranded RNA, 98% of the pools evidenced hdghly fimctional 
activity (>95% silencing). 



Example XL Human cydophyline 



Table HI above lists the siRNA sequences for the human cyclophylii 
protein. A particularly functional siRNA may be selected by applying these 
sequences to any of Formula I to VII above. 



5 Alternatively, one could pool 2, 3, 4, 5 or more of these sequences to create a 

kit for silencing a gene. Preferably, within the kit there would be at least one 
sequence that has a relatively high predicted functionality when any of Formulas I - 
VII is applied. 

1 0 Example XII. Sample Pools of siRNAs and Their AppUcation to Human Disease 
The genetic basis behind human disease is well documented and siRNA may 
be used as both research or diagnostic tools and therapeutic agents,- either individually 
or in pools. Genes involved in signal transduction, the immune response, apoptosis, 
DNA repair, cell cycle control, and a variety of other physiological functions have 

15 clinical relevance and therapeutic agents that can modulate expression of these genes 
may alleviate some or all of the associated symptoms. In some instances, these genes 
can be described as a member of a family or class of genes and siRNA (randomly, 
conventionally, or rationally designed) can be directed against one or multiple 
members of the family to induce a desired result. 

20 

To identify rationally designed siRNA to each gene, the sequence was 
analyzed using Formula VIH to identify a SMARTpool containing the fimctional 
sequences. To confirm the activity of these sequences, the siRNA are introduced into 
a cell type of choice (e.g. HeLa ceUs, HEK293 ceUs) and the levels of the appropriate 

25 message are analyzed using one of several art proven techniques. SiRNA having 

heightened levels of potency can be identified by testing each of the before mentioned 
duplexes at increasingly limiting concentrations. Similarly, siRNA having increased 
levels of longevity can be identified by introducing each duplex into cells and testing 
functionality at 24, 48, 72, 96, 120, 144. 168, and 192 hours after transfection. Agents 

30 that induce >95% silencing at sub-nanomolar concentrations and/or induce functional 
levels of silencing for >96 hours are considered hyperfunctional. 



The following are non-limiting examples of families of proteins to which 
siRNA described in this document are targeted against: 



»1 



Transporters, Pumps, and Channels 

Transporters, pumps, and channels represent one class of genes that are 
attractive targets for siRNAs. One major class of transporter molecules are the ATP- 
binding cassette (ABC) transporters. To date, nearly 50 human ABC-transporter 
genes have been characterized and have been shown to be involved in a variety of 
physiological functions including transport of bile salts, nucleosides, chloride ions, 
cholesterol, toxins, and more. Predominant among this group are MDRl (which 
encodes the P-glycoprotein, NP_00091 8), the MDR-related proteins (MRPl-7). and 
the breast cancer resistance protein (BCRP). In general, these transporters share a 
common structure, with each protein containing a pair of ATP-binding . domains (also 
known as nucleotide binding folds, NBF) and two sets of transmembrmie (TM) 
domains, each of which typically contains six membrane-spanning a-helices. The 
genes encoding this class of transporter are organized as either foil transporters (U. 
containing two TM and two NBF domains) or as half transporters that assemble as 
either homodimers or heterodimers to create fonctional transporters. As a whole, 
members of the family are widely dispersed throughout the genome and show a high 
degree of amino acid sequence identify among eukaryotes. 

ABC-transporters have been imphcated in several human diseases. For 
instance, molecular efflux pumps of this type play a major role in the development of 
drug resistance exhibited by a variety of cancers and pathogenic microorganisms. In 
the case of human cancers, increased expression of the MDRl gene and related pumps 
have been observed to generate drug resistance to a broad collection of commonly 
used chemothra-apeutics including doxorubicin, daunorubidn, vinblastine, vincristine, 
colchicines. In addition to the contribution these transporters make to the 
development of multi-drug resistance, there are currently 1 3 human genetic diseases 
associated with defects in 14 different transporters. The most common of these 
conditions include cystic fibrosis, Stargardt disease, age-related macular degeneration, 
adrenoleukodystrophy, Tangier disease, Dubin-Johnson syndrome and progressive 
iamihal intrahepatic cholestasis. For this reason, siRNAs directed against members of 
this, and related, famihes are potentially valuable research and therapeutic tools. 



With respect to channels, analysis of Drosophila mutants has enabled the 
initial molecular isolation and characterization of several distinct channels including 
(but not limited to) potassium (K+) channels. This list includes shaker (Sh), which 
encodes a voltage activated channel, slowpoke (Slo), a Ca^"^ activated channel, 
5 and ether-a-go-go (Eag). The Eag family is further divided into three subfamilies: 
Eag, Elk (eag-like K channels), and Erg (Eag related genes). 

The Erg subfamily contains three separate family members (Ergl-3) that are 
distanfly related to the sh family of voltage activated channels. Like sh, erg 

1 0 polypetides contain the classic six membrane spanning architecture of K* channels 
(S1-S6) but differ in that each includes a segment associated witfi the C-tenninal 
cytoplasmic region that is homologous to cyclic nucleotide binding domains (cNBD). 
Like many isolated ion channel mutants, erg mutants are temperature-sensitive 
paralytics, a phenotype caused by spontaneous repetitive firing (hyperactivity) in 

1 5 neurons and enhanced transmitter release at the neuromuscular junction. 

Liitial studies on the tissue distribution of all three members of the erg 
subfamily show two general patterns of expression. Ergl and erg3 are broadly 
expressed throughout the nervous system and are observed in the heart, the superior 

20 mesenteric ganglia, the celiac ganglia, the retina, and the brain. In contrast, erg2 
shows a much more restricted pattern of expression and is only observed in ceHac 
ganglia and superior mesenteric gangHa. Similarly, the kinetic properties of the three 
erg potassium channels are not homogeneous. Ergl and erg2 channels are relatively 
slow activating delayed rectifiers whereas the erg3 current activates rapidly and then 

25 exhibits a predominantly transient component that decays to a sustained plateau. The 
current properties of all three channels are sensitive to meflianesulfonanilides, 
suggesting a high degree of conservation in the pore structure of all three proteins. 

Recently, the erg family of channels has been implicated in human disease. 
30 Consistent with the observation that ergl is expressed in the heart, single strand 
conformation polymorphism and DNA sequence analyses have identified HERG 
(human ergl) mutations in six long-QT-syndrome (LQT) families, an inherited 
disorder that results in sudden death firom a ventricular tachyarrythmia. Thus siRNA 
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directed against this group of molecules (e.g. KCNHl-8) will be of extreme 
therapeutic value. 
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Another group of channels that are potential targets of siRNAs are 
the CLCA family that mediate a Ca^^-activated CP conductance in a variety of 
tissues. To date, two bovine (bCLCl; bCLCA2 (Lu-ECAM-1)), three mouse 
(mCLCAl ; mCLCA2; mCLCA3) and four human (hCLCAl ; hCLCA2; hCLCAS; 
hCLCA4) CLCA family members have been isolated and patch-clamp studies with 
transfectedhuman embryonic kidney (HEK-293) cells have shown that bCLCAl, 
mCLCAl, and hCLCAl mediate a Ca^^-activated CF conductance that can be ' 
inhibited by the anion chamiel blocker DIDS and the reducing agent dithiothreitol 
(DTT). 

The protein size, structure, and processing seem to be similar among different 
CLCA family members and has been studied in greatest detail for Lu-ECAM-1 . ITie 
Lu-ECAM-1 open reading frame encodes a precursor glycoprotein of 130 kDa that is 
processed to a 90-kDa amino-temiinal cleavage product and a group of 30- to 40-kDa 
glycoproteins that are glycosylation variants of a single polypeptide derived from its 
carboxy terminus. Both subunits are associated with the outer cell surface, but only 
the 90-kDa subunit is thought to be anchored to.the cell membrane via four 
transmembrane domains. 
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Although the proteinprocessing and function appear to be conserved among . 
CLCA homologs, significant differences exist in their tissue expression patterns. For 
example, bovine Lu-ECAM-1 is expressed primarily in vascular endothelia, bCLCAl 
is exclusively detected in the trachea, and hCLCAl is selectively expressed in a 
subset of humanintestinal epithelial cells. Thus the emerging pictureis that of a 
multigene family with members that are highly tissue specific, similar to the QC 
family of voltage-gated CP chamiels. The human chamiel, hCLCA2, is particular 
interesting from a medical and phannacological standpoint. CLCA2 is expressed on 
the luminal surface of lung vascular endothelia and serves as an adhesion molecule 
for lung metastatic cancer cells, thus mediating vascular arrest and lung colonization. 
Expression of this molecule in normal mammary epithelium is consistently lost in 
human breast cancer and in nearly all tumorigenic breast cancer cell lines. Moreover, 



re^xpression of hCLCA2 in human breast cancer cells abrogates tumorigenicily in 
nude mice, implying that hCLCA2 acts as a tumour suppressor in breast cancer. For 
these reasons, siRNA directed against CLCA family members and related charaiels 
may prove to be valuable in research and therapeutic venues. 

Transporters Involved in Synaptic Transmission. 

Synaptic transmission involves the release of a neurotransmitter into the 
synaptic cleft, interaction of that transmitter with a postsynaptic receptor, and 
subsequent removal of the transmitter from the cleft, hi most synapses the signal is 
terminated by a rapid reaccumulation of the neurotransmitter into presynaptic 
terminals. This process is catalyzed by specific neurotransmitter transporter that are 
often energized by the electrochemical gradient of sodium across the plasma 
membrane of the presynaptic cells. 

Aminobutyric acid (GAB A) is the major inhibitory neurotransmitter in the 
central nervous system. The inhibitory action of GABA, mediated through GABAa/ 
GABAb receptors, and is regulated by GABA transporters (GATs), integral 
membrane proteins located perisynaptically on neurons and gUa. So far four different 
carriei^ (GAT 1 -GAT4) have been cloned and their cellular distribution has been 
partly worked out. Comparative sequence analysis has revealed that GABA 
transporters are related to seyeral other proteins involved in neurotransmitter uptake 
including gamma-aminobutyric acid transporters, monoamine transporters, amino 
acid transporters, certain "orphan" transporters, and the recently discovered bacterial 
transporters. Each of these proteins has a similar 12 transmembrane heUces topology 
and relies upon the Na+/Cl- gradient for transport fiinction. Transport rates are 
dependent on substrate concentrations, with half-maximal effective concentrations for 
transport frequently occurring in the submicromolar to low micromolar range, hi 
addition, transporter fimction is bidirectional, and non-vesicular efflux of transmitter 
may contribute to ambient extracellular transmitter levels. 

Recent evidence suggests that GABA transporters, and neurotransmitter 
transporters in general, are not passive players in regulating neuronal signaling; rather, 
transporter fimction can be altered by a variety of initiating factors and signal 
transduction cascades. In general, this fimctional regulation occurs in two ways. 
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either by changing the rate of transmitter flux through the transporter or by changing 
the number of functional transporters on the plasma membrane. A recurring theme in 
transporterregulation is the rapid redistribution of the transporter protein between 
intracellular locations and the cell surface, ta general, this fimctional modulation 
occurs in part through activation of second messengers such as kinases, phosphatases, 
arachidonic acid, and pH. However, the mechanisms underlying transporter 
phosphorylation and transporter redistribution have yet to be folly elucidated. 

G ABA transporters play a pathophysiological role in a number of human 
diseases including temporal lobe epilepsy and are the targets of pharmacological 
interventions. Studies in seizure sensitive animals show some (but not all) of the GAT 
transporters have altered levels of expression at times prior to and post seizure, 
suggesting this class of transporter may affect epileptogenesis, and that alterations 
following seizure may be compensatory responses to modulate seizure activity. For 
these reasons, siRNAs directed against members of this family of genes (including but 
not limited to SLCG6AM2) may prove to be valuable research and therapeutic tools 



Organic Ion Transporters. 

The human body is continuously exposed to a great variety of xenobiotics, via 
20 food, dmgs, occupation, and enviromnent. Excretory organs such as kidney, liver, and 
intestine defend the body against the potentially harmfol effects of these compounds 
by transforming them into less active metabolites that are subsequently secreted from 
the system. 
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Carrier-mediated transport of xenobiotics and their metabolites exist for the 
active secretion of organic anions and cations. Both systems are characterized by a 
high clearance opacity and tremendous diversity of substances accepted, properties 
that result from the existance of multiple transporters with overtyping substrate 
specificities. The class of organic anion transporters plays a critical role in the 
elimination of a large number of drugs (e.g., antibiotics, chanotheiapeutics. diuretics, 
nonsteroidal anti-inflammatory drugs, radiocontrast agents, cytostatics); drug 
metabolites (especially conjugation products with glutathione, glucurorride, glycine, 
sulfate, acetate); and toxicants and their metabolites (eg., mycotoxins, herbicides, ' 



plasticizers. glutathione 5-conjugates of polyhaloalkanes, polyhaloalkenes, 
hydroquinones, aminophenols), many of which are specifically harmful to the kidney. 

Over the past couple of years the number of identified anion transporting 
molecules has groAvn tremendously. Uptake of organic anions (OA-) across the " 
basolateral membrane is mediated by the classic sodimn-dependent organic anion 
transport system, which includes a-ketoglutarate (cc-KG^-yOA- exchange via the 
organic anion transporter (OATl) and sodium-ketoglutarate cotransport via the 
NaVdicaiboxylate cotransporter (SDCT2). The organic anion transporting polypetide, 
Oatpl , and the kidney-specific OAT-Kl and OAT.K2 are seen as potential molecules 
that mediate facUitated OA" efflux but could also be involved in reabsoiption via an 
exchange mechanism. Lastly the PEPTl and PEPT2 mediate luminal uptake of 
peptide dmgs, whereas CNTl and CNT2 are involved in reabsorption of nucleosides 

The organic anion-transporting polypeptide 1 (Oatpl) is a Na"^- and ATP- 
independent transporter originally cloned fi^om rat liver. The tissue distribution and 
transport properties of the Oatpl gene product are complex. Oatpl is localized to the 
basolateral membrane of hepatocytes, and is found on the apical membrane of S3 
proximal tubules. Studies with transiently transfected cells (e.g. HeLa ceUs) have 
indicated that Oatpl mediates transport of a variety of molecules including 
taurocholate, estrone-3 -sulfate, aldosterone, Cortisol, and others. The observed uptake 
of taurocholate by Oatpl expressed inX laevis oocytes is accompanied by efflux of 
GSH, suggesting that transport by this molecule may be glutathione dependent. 

Computer modeling suggests that members of the Oatp family are highly 
conserved, hydrophobic, and have 12 transmembrane domains. Decreases in 
expression of Oatp family members have been associated with cholestatic liver 
diseases and human hepatoblastomas, making this family of proteins of key interest to 
researchers and the medical communily. For these reasons, siRNAs directed against 
OAT family members (including but ilot limited to SLC21A2, 3, 6, 8, 9, 11, 12, 14, 
1 5, and related transporters) are potentially useful as research and therapeutic t^ls.' 



Nucleoside transporters. 



Nucleoside transporters play key roles in physiology and phamiacology. 
Uptake of exogenous nucleosides is a critical first step of nucleotide synthesis in 
tissues such as bone marrow and intestinal epithelium and certain parasitic organisms 
that lack de novo pathways for purine biosynthesis. Nucleoside transporters also 
control the extracellular concentration of adenosine in the vicmity of its cell surface 
receptors and regulate. processes such as neurotransmission and cardiovascular 
activity. Adenosine itself is used clinically to treat cardiac ^hythmias, and nucleoside 
transport inhibitors such as dipyridamole, dilazep, and draflazine function as coronary 
vasodilators. 



In mammals,plasma membrane transport of nucleosides is brought about by 
membersof the concentrative, Na^-dependent (CNT) and equilibrative, Na^- 
independent (ENT) nucleoside transporter families. CNTs are expressed in a tissue- 
specific fashion; ENTs are present in most, possibly all, cell types and are responsible 
for the movement of hydrophilic nucleosides and nucleoside analogs down their 
concentration gradients, hi addition, structure/function studies of ENT family 
members have predicted these molecules to contain eleven transmembrane helical 
segments with an amino terminus that is intracellular and a carboxyl terminus that is 
extracellular. The proteins have a large glycosylated loop between TMs 1 and 2 and 
a large cytoplasmic loop between TMs 6 and 7. Recent investigations have implicated 
the TM 3-6 region as playing a central role in solute recognition. The medical 
importance of the ENT family of proteins is broad, hi humans adenosine exerts a 
range ofxardioprotective effects and inhibitors of ENTs are seen as being valuable in 
alleviating a variety of cardio/cardiovascular ailments, hi addition, responses to 
nucleoside analog drugs has been observed to vary considerably amongst e.g. cancer 
patients. While some forms of drug resistance have been shown to be tied to the up- 
regulation of ABC-transporters {e.g. MDRl), resistance may also be the result of 
reduced dmg uptake {i.e. reduced ENT expression). Thus, a clearer understandmg of 
ENT transporters may aid m optimizing drug treatments for patients suffering a wide 
range of malignancies. For these reasons, siRNAs directed against this class of 
molecules (including SLC28A1-3, SLC29A1.4, and related molecules) may be useful 
as therapeutic and research tools. 



Sulfate Transporters.' 



99 

All cells require inorganic sulfate for normal function. Sulfate is the fourth 
most abundant anion in human plasma and is the major source of sulfur in many 
organisms. Sulfation of extracellular matrix proteins is critical for maintaining 
normal cartilage metabolism and sulfate is an important constituent of myelin 
5 membranes found in the brain 



Because sulfate is a hydrophilic anion that cannot passively cross the lipid 
bilayer of cell membranes, all cells require a mechanism for sulfate influx and efflux 
to ensure an optimal supply. To date, a variety of sulfate transporters have been 

1 0 identified in tissues fi-om many origins. These include the renal sulfate transporters 
(NaSi-1 and Sat-1), the ubiquitously expressed diastrophic dysplasia sulfate 
transporter (DTDST), the intestinal sulfate transporter (DRA), and the erythrocyte 
anion exchanger (AEl). Most, if not all, of these molecules contain the classic 12 
transmembrane spamiing domain architecture commonly found amongst members of 

15 the anion transporter svq)erfamily. 

Recently three different sulfate transporters have been associated with specific 
human genetic diseases. Family members SLC26A2, SLC26A3, and SLC26A4 have 
been recognized as the disease genes mutated in diastrophic dysplasia, congenital 

20 chloride diarrhea (CLD), and Pendred syndrome (PDS), respectively? DTDST is a 

particularly complex disorder. The gene encoding this molecule maps to chromosome 
5q, and encodes two distinct transcripts due to alternative exon usage, hi contrast to 
other sulfate transpOEters (e.g. Sat-1) anion movement by the DTDST protein is 
markedly inMbited by either extracellular chloride or bicarbonate, hnpaired function 

25 of the DTDST gene product leads to undersulfation of proteoglycans and a complex 
family of recessively inherited osteochondrodysplasias (achondrogenesis type IB, 
atelosteogenesis type II, and diastrophic dysplasia) with clinical features including but 
not limited to, dwarfism, spinal deformation, and specific joint abnormalities. 
Interestingly, while epidemiological studies have shown that the disease occurs in 

30 most populations, it is particularly prevalent in Finland owing to an apparent founder 
effect. For these reasons, siRNAs directed against this class of gaies (including but 
not limited to SLC26A1-9, and related molecules) may be potentiaUy helpfijl in both 
therapeutic and research venues. 
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Ion Exchangers 

Intracellular pH regulatory mechanisms are critical for the maintenance of 
countless cellular processes. For instance, in muscle cells, contractile processes and 
metabolic reactions are influenced by pH. Duringperiods of increased energy 
demands and ischemia, muscle cellsproduce large amounts of lactic add that, without 
quick and efficient disposal, would lead to acidification of the sarcoplasm. 

Several different transport mechanisms have evolved to maintain a relatively 
constant intracellular pH. The relative contribution of each of these processes varies 
with cell type, the metabolic requirements of the cell, and the local environmental 
conditions, hitracellular pH regulatory processes that have been characterized 
fimctionally include but are not limited to the NaW exchange, the Na(HC03)„ 
cotransport, and the Na*-dependent and -independent CF/base exchangers. As 
bicarbonate and CO2 comprise the major pH buffer of biological fluids, sodium 
biocarbonate cotransporters (NBCs) are critical. Studies have shown that these 
molecules exist in numerous tissues including the kidney, brain, Uver, cornea, heart, 
and lung, suggesting that NBCs play an important role in mediating HCO3- transport 
in both epithelial as well asnonepithelial cells. 

Recent molecular cloning experiments have identified the existeiice of four 
NBC isoforms (NBCl, 2, 3 and 4) and two NBC-related proteins, AE4 and NCBE 
(Anion Exchanger 4 and Na-dependent Chloride-Bicarbonate Exchanger). The 
secondary structuEe::aaal^ses and hydropathy profile of this family predict them to be 
intrinsic membrane proteins with 12 putative transmembrane domains and several 
family members exhibit iV-linked glycosylation sites, protein kinases A and C, casein 
kinase II, and ATP/GTP-binding consensus phosphorylation sites, as well as potential 
sites for myristylation and amidation. AE4 is a relatively recent addition to this 
family of proteins and shows between 30-48% homology with the other family 
members. When expressed in COS-7 cells mdXenopm oocytes AE4 exhibits sodium- 
independent and DIDS-insensitive anion exchanger activity. Exchangers have been 
shown to be responsible for a variety of human diseases. For instance, mutations in 
three genes of the anion transporter family (SLC) are believed to cause known 
hereditary diseases, including chondrodysplasia (SLC26A2, DTD), diarrhea (A3, 
down-regulated in adenoma/chloride-losing diarrhea protein: DRA/CLD), and 
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goiter/deafoess syndrome (A4, pendrin). Moreover, mutations in Na+/HC03 
transporters have also been associated with various human maladies. For these 
reasons, siRNAs directed against these sorts of genes ie.g. SLC4A4-10, and related 
genes) may be usefol for therapeutic and research purposes. 

Receptors Involved in Synaptic Transmission 

In all vertebrates, fast inhibitory synaptic transmission is the result of the 
interaction between the neurotransmitters glycine (Gly) and y-aminobutyric acid 
(GABA) and their respective receptors. The strychnine-sensitive glycine receptor is 
especially important in that it acts in the mammalian spinal cord and brain st&a and 
has a well-established role in the regulation of locomotor behavior. 

Glycine receptors display significant sequence homology to several other 
receptors including the nicotinic acetylchoUne receptor, the aminobutyric acid 
receptor type A (GABAaR), and the serotonin receptor type 3 (5-HT3R) subunits. As 
members of the superfamily of ligand-gated ion channels, these polypeptides share 
common topological features. The glycine receptor is composed of two types of 
glycosylated integral membrane proteins (al-a4 and P) arranged in a pentameric 
suprastructure. The alpha subunit encodes a large extracellular, N-terminal domain 
that carries the structural determinants essential for agonist and antagonist^inding, 
followed by four transmembrane spanning regions (TM1-TM4), with TM2 playing the 
critical role of forming the inner wall of the chloride channel. 



The density, location, and subunit composition of glycine neurotransmitter 
receptors changes over the course of development. It has been observed that the 
amount of GlyR gene translation (assessed by the injection of developing rat cerebral 
cortex mRNA into Xenopus oocytes) decreases with age, whereas that of GABARs 
increases, hi addition, the type and location of mRNAs coding for GlyR changes over 
the course of development. For instance in a study of the expression of alpha 1 and 
alpha 2 subunits in the rat, it was observed that (in embryonic periods El 1-1 8) the 
mantle zone was scarce in the alpha 1 mRNA, but the germmal zone (matrix layer) at 
El 1-14 expressed higher levels of the message. At postnatal day 0 (PO), the alpha 1 
signals became manifested throughout the gray matter of the spinal cord. By contrast. 
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the spinal tissues at PO exhibited the highest levels of alpha 2 mRNA, which 
decreased with the postnatal development. 

In both, man and mouse mutant lines, mutations of GlyR subunit genes result 
in hereditary motor disorders characterized by exaggerated startle responses and 
increased muscle tone. Pathological alleles of the Glral gene are associated with the 
murine phenotypes oscillator (spd"^) and spasmodic (spd). Similarly, a mutant allele 
of Glrb has been found to underly the molecular pathology of the spastic mouse (spa). 
Resembling the situation in the mouse, a variety of GLRAl mutant alleles have been 
shown to be associated with the human neurological disorder hyperekplexia or startle 
disease. For these reasons, siRNA du^ed against glycine receptors (GLRAl -3, 
GLRB, and related molecules), glutamate receptors, GABA receptors, ATP receptors, 
and related neurotransmitter receptor molecules may be valuable therapeutic and 
research reagents. 

Proteases 

Kallikreins 

One important class of proteases are the kallikreins, serine endopeptidases that 
split peptide substrates preferentially on the C-terminal side of internal arginyl and 
lysyl residues. BCallikreins are generally divided into two distinct groups, plasma 
kallikreins and tissue kallikreins. Tissue kallikreins represent a large group of 
enzymes that have substantid^milarities at both the gene and protein level. The 
genes encoding this group are frequently found on a single chromosome, are 
organized in clusters, and are expressed in a broad range of tissues (e.g. pancreas, 
ovaries, breast). In contrast, the plasma form of the enzyme is encoded by a single 
geme {e.g. KLK3) that has been localized to chromosome 4q34-35 in humans. The 
gene encoding plasma kallikrein is expressed solely in the liver, contains 15 exons, 
and encodes a glycoprotein that is translated as a preprotein called prekallikrein. 

Kallikreins are believed to play an important role in a host of physiological 
events For instance, the immediate consequence of plasma prekallikrein activation is 
the cleavage of high molecular weight kininogen (HK) and the subsequent liberation 
of bradykinin, a nine amino acid vasoactive peptide that is an important mediator of 



inflammatory responses. Similarly, plasma kaUikrein promotes single-chain urokinase 
acrivation and subsequent plasminogen activation, events that are critical to blood 
coaggulation and wound healing. 

5 Disruptions in the function of kaUikreins have been implicated in a variety of 

pathological processes including imbalances in renal function and inflammatory 
processes. For these reasons, siRNAs directed against this class of genes (e.g. KLKl- 
1 5) may prove valuable in both research and therapeutic settings. 

10 ADAM Protein.s 

The process of fertilization takes place in a series of discrete steps whereby the 
sperm interacts withri) the cumulus cells and the hyaluronic acid extracellular matrix 
(ECM) in which they are embedded, ii) the egg's own ECM, called the zona pellucida 
(ZP), and iii) the egg plasma membrane. During the course of these interactions, the 

1 5 "acrosome reaction," the exocytosis of the acrosome vesicle on the head of the sperm, 
is induced, allowing the sperm to penetrate the ZP and gain access to the peiivitelline 
space. This process exposes new portions of the sperm membrane, including the inner 
acrosomal membrane and the equatorial segment, regions of the speim head that can 
participate in initial gamete manbrane binding. 

20 

The interactions of the gamete plasma membranes appear to involve multiple 
ligands and receptors and are frequently compared to leukocyte-endothelial 
interactions. These interactions lead- to a series of signal transduction events in the 
egg, known as coUectively as egg activation and include the initiation of oscillations 
25 in intracellular calcium concentration, the exit from mdosis, the entry into the first 
embryonic mitosis, and the formation of a block to polyspermy via the release of ZP- 
modifying enzymes from the egg's cortical granules. Ultimately, sperm and egg not 
only adhere to each other but also go on to undergo membrane fiision, making one 
cell (the zygote) from two. 
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Studies on Ifae process of sperm-egg interactions have identified a number of 
proteins that are crucial for fertilization. One class of proteins, caUed the ADAM 
femily (A Disintegrin And Metalloprotease), has been found to be important in 
spermatogenesis and fertilization, as well as various developmental systems including 



myogenesis and neurogenesis. Members of the family contain a disintegrin and 
metalloprotease domain (and therefore have (potentially) both cell adhesion and 
protease activities), as well as cysteme-rich regions, epideimal growth factor (EGF)- 
like domains, a transmembrane region, and a cytoplasmic tail. Currently, the ADAM 
gene family has 29 members and constituents are widely distributed in many tissues 
including the brain, testis, epididymis, ovary, breast, placenta, liver, heart, lung, bone, 
and muscle. 

One of the best-studied members of the ADAM family is fertilin, a 
heterodimeric protein comprised of at least two subunits, fertilin alpha and fertilin 
beta. The fertilin beta gene (ADAM2) has been disrupted with a targeting gene 
construct corresponding-to the exon encoding the fertilin beta disintegrin domain. 
Speam from males homozygous for disruptions in this region exhibit defects in 
multiple fecets of sperm function including reduced levels of sjiemi transit from the 
uterus to the oviduct, reduced sperm-ZP bindmg, and reduced sperm-egg binding, all 
of which contribute to male infertility. 

Recently, four new ADAM family members (ADAM 24-27) have bWn 
isolated. The deduced amino acid sequences show that all four contain the complete 
domain organization common to ADAM family members and Northern Blot analysis 
has shown aU four to be specific to the testes. SiRNAs directed against this class of 
genes (e.g. ADAM2 and related proteins) may be useful as research tools and 
therapeutics directed toward fertility . and birth control. 

Aminopep tidases 

Aihinopeptidases are proteases fcat play critical roles in processes such as 
protein maturation, protein digestion in its terminal stage, regulation of hormone 
levels, selective or homeostatic protein turnover, and plasmid stabiUzation. These 
enzymes generally have broad substrate specificity, occur in several forms and play a 
major role in physiological homeostasis. For instance, the effects of bradykinin, 
angiotensin converting enzyme (ACE), and other vasoactive molecules are muted by 
one of several peptidases that cleave the molecule at an intanal position and eliminate 
its ability to bind its cognate receptor (e.g. for bradykinin, the B2-receptor). 



Among the enzymes that can cleave bradykinin is the membrane bound 
aminopeptidase P, also referred to as aminoacylproline aminopeptidase, proline 
aminopeptidase; X-Pro aminopeptidase (eukaryote) and XPNPEP2. Aminopeptidase 
P is an aminoacylproline aminopeptidase specific for NHa-terminal Xaa-proline 
5 bonds. The enzyme i) is a mono-zinc-containing molecule that lacks any of the typical 
metal bmding motifs found in other zinc metalloproteases, ii) has an active-site 
configuration similar to that of other members of the MG peptidase family, and iii) is 
present in a variety of tissues including but not limited to the lung, kidney, brain, and 
intestine. 

10 

Aminopeptidases play an important role in a diverse set of human diseases. 
Low plasma concentrations-of aminopeptidase P are a potential predisposing factor 
for development of angio-oedema in patients treated with ACE inhibitors, and 
inhibitors of aminopeptidase P may act as cardioprotectors against other forms of 
1 5 illness including, but not limited to myocardial infarction. For these reasons, siRNAs 
directed against this family of proteins (including but not limited to XPNPEPl and 
related proteins) may be useful as research and therapeutic tools. 

Serine Proteases 

20 • One important class of proteases are the serine proteases. Serine proteases 

share a common catalytic triad of three amino acids in their active site (serine 
(nucleophile), aspartate (electrophile), and histidine (base)) and can hydrolyze either 
esters or peptide bonds utilizing mechaiusms-of covalent catalysis and preferential 
binding of the transition state. Based on the position of their introns serine proteases 

25 have been classified into a minimum of four groups including those m which 1) the 
gene has no introns intarupting the exon coding for the catalytic triad (e.g. the 
haptoglobin gene,); 2) each gene contains an intron just downstream from the codon 
for the histidine residue at the active site, a second intron downstream from the exon 
containing the aspartic acid residue of the active site and a third intron just upstream 

30 from the exon contaming the serine of ttie active site (e.g. trypsinogen, 

chymotrypsinogen, kallikrein and proelastase); 3) the genes contain seven introns 
interrupting the exons coding the catalytic region {eg. complement factor B gene); 
and 4) the genes contain two introns resulting in a large exon that contains both the 



active site aspartatic acid and serine residues (e.g. factor X, factor IX and protein C 
genes). 



a 

over 



Cytotoxic lymphocytes (e.g. CD8(H-) cytotoxic T cells and natural killer cells) 
form tbe major defense of higher organisms against virus-infected and transfonned 
cells. A key function of these cells is to detect and eliminate potentially harmful cells 
by inducing them to undergo apoptosis. This is achieved through two principal 
pathways, both of which require direct but transient contact between the killer cell and 
its target. The first pathway involves ligation of TNF receptor-like molecules such as 
Fas/CD95 to their cognate ligands, and results in mobiUzation of conventional, 
programmed cell-death pathways centered on activation of pro-apoptotic caspases. 
The second mechanism consists-of a pathway whereby the toxic contents of 
specialized class of secretory vesicles are introduced into the target cell. Studies 
the last two decades have identified the toxic components as Granzymes, a family of 
serine proteases that are expressed exclusively by cytotoxic T lymphocytes and 
natural killer (NK) cells. These agents are stored in specialized lytic granules and 
enter the target cell via endocytosis. Like caspases, cysteine proteases that play an 
important role in apoptosis, granzymes can cleave proteins after acidic residues, 
especially aspartic add, and induce apoptosis in the recipient cell. 

Granzymes have been grouped into three subfamilies according to substrate 
specificity. Members of the granzyme family that have enzymatic activity similar to 
the serine protease chymotrypsin are enc0ded.b5La.gene cluster termed the 'chymase 
locus'. Similarly, granzymes with trypsin-like specificities are encoded by the 
•tryptase locus', and a third subfamily cleaves after unbranched hydrophobic residues, 
especiaUy methionine, and are encoded by the "Met-ase locus'. All granzymes are 
synthesized as zymogens and, after clipping of the leader peptide, obtain maximal 
enzymatic activity subsequent to the removal of an amino-teiminal dipeptide. 

Granzymes have been found to be important in a number of important 
biological fimctions including defense against intracellular pathogens, graft versus 
host reactions, the susceptibility to h-ansplantable and spontaneous malignancies, 
lymphoid homeostasis, and the tendency toward auto-immune diseases. For these 



reasons, siRNAs directed against granszymes (e.g. GZMA, GZMB, GZMH, GZHK, 
GZMM) and related serine proteases may be useful research and therapeutic reagent 



Kinases 

Protein Kinases (PKs) have been implicated in a number of biological 
processes. Kinase molecules play a central role in modulating cellular physiology and 
developmental decisions, and have been impUcated in a large Ust of human maladies 
including cancer, diabetes, and others. 



During the course of the last three decades, over a hundred distinct protein 
kinases have been identified, all Avith presumed specific cellular fimctions. A few of 
these enzymes have been isolated to sufficient purity to perform in vitro studies, but 
most remain intractable due to the low abundance of these molecules in the cell. To 
counter this technical difficulty, a number of protein kinases have been isolated by 
molecular cloning strategies that utilize the conserved sequences of the catalytic 
domain to isolate closely related homologs. Alternatively, some kinases have been 
purified (and subsequently studied) based on their interactions with other molecules. 

p58 is a member of the p34cdc2-related supergene family and contains a large 
domain that is highly homologous to the ceU division control kinase, cdc2. This new 
cell division control-related protein kinase was originally identified as a component of 
semipurified galactosyltransferase; thus, it has been denoted galactosyltransferase- 
associated protein kinase (GTA-kinase). GTAJdnaseJias been found to be expressed 
in both adult and embryonic tissues and is known to phosphoiylate a number of 
substrates, including histone HI , and casein. Interestingly enough, over expression of 
this molecule in CHO ceUs has shown that elevated levels of p58 result in a prolonged 
late telophase and an early Gl phase, thus hinting of an important role for GTA- 
kinase in cell cycle regulation. 



Cvclin Dep endent Kina.ses 

The cyclin-dependent kinases (Cdks) are a family of highly conserved 
serine/threonine kinases that mediate many of the ceU cycle transitions that occur 
during dupUcation. Each of these Cdk catalytic subunits associates with a specific 



subset of regulatory subunits, termed cyclins, to produce a distinct Cdk cyclin kinase 
complex that, in general, functions to execute a unique ceU cycle event 

Activation of the Cdk cyclin kinases during cellular transitions is controUed by 
a variety of regulatory mechanisms. For theCdc2-cyclin B complex, inhibition of 
kinase activity during Sphase and G2 is accomplished by phosphorylation of two 
Cdc2 residues, Thr" and Tyr'^ which are positioned within the ATP-binding cleft. 
Phosphorylation of Thr'^ and/or Tyr'^ suppresses the catalytic activity of the molecule 
by dismpting the orientation of the ATP present within this cleft. In contrast, the 
abrupt dephosphorylation of these residues by the Cdc25 phosphatase results in the 
rapid activation of Cdc2 cyclin B kinase activity and subsequent downstream mitotic 
ev«its. While the exact details of this pathway have yet to be elucidated, it has been 
proposed that Thr'^/Tyr»^ phosphorylation functions to permit a cell to attain a critical 
concentration of inactive Cdk cyclin complexes, which, upon activation, induces a 
rapid and complete cell cycle transition. Furthennore, there is evidence in 
mammalian cells that Thr'^/Tyr'^ phosphorylation also functions to delay Cdk 
activation after DNA damage. 

The Schizosaccharomycespombe weel gene product was the firet kinase 
identified that is capable of phosphorylating Tyr'^ in Cdc2. Homologs of the Weel 
kinase have been subsequently identified and biochemically characterized from a wide 
range of species including human, mouse, frog, Sacclmromyces cerevisiae, and 
Drosophila. In vertebrate systems, where Ihr'^'.iitSd^s.also phosphoiylated, the 
Weel kinase was capable ofphosphorylating Cdc2 on Tyr•^ but not Thr•^ indicating 
that another kinase was responsible for Thr"* phosphorylation. This gene, Mytl 
kinase, was recently isolated from the membrane fractions oiXenopus egg extracts 
and^has been shown to be capable ofphosphorylating Thr'' and, to a lessor extent, 
Tyr'5 in Cdc2. A human Mytl homolog displaying similar properties has been 
isolated, as well as a non-membrane-associated molecule with Thr''' kinase activity. 

hi the past decade it has been shown that cancer can originate from 
overexpression of positive regulators, such as cyclins, or from underexpression of 
negative regulators {e,g. pl6 (INK4a), pl5 (INK4b), p21 (Cipl)). Inhibitors such as 
Mytl are the focus of much cancer research because they are capable of controlling 



cell cycle proliferation, now considered the Holy Grail for cancer treatment. For these 
reasons, siRNA directed against kinases and kinase inhibitors including but not 
limited to ABLl, ABL2, ACKl, ALK, AXL, BLK, BMX, BTK, C20orf64, CSFIR, 
SCK, DDRl, DDR2, DKFZp761P1010, EGFR, EPHAl, EPHA2, EPHA3, EPHA4, 
5 EPHA7, EPHA8, EPHBl , EPHB2, EPHB3, EPHB4. EPHB6, ERBB2, ERBB3, 
ERBB4, FER, FES, FGFRl, FGFR2, FGFR3, FGFR4, FGR, FLTl, FLT3, FLT4, 
FRX, FYN, HCK, IGFIR, INSR, ITK, JAKl, JAK2, JAK3, KDR, KIAA1079, KIT, 
LCK, LTK, LYN, MATK, MERTK, MET, MSTIR, MUSK, NTRKl, NTRK2, 
NTRK3, PDGFRA, PDGFRB, PTK2, PTK2B, PTK6, PTK7, PTK9, P1X9L, RET, 
10 RORl, ROR2, ROSl, RYK, SRC, SYK, TEC, TEK, TIE, TNKl, TXK, TYK2, 
TYR03, YESl , and related proteins, may be useful for research and therapeutic 
purposes. 

G Protein Coupled Receptors 

15 

One important class of genes to which siRNAs can be directed are G-protein 
coupled receptors (GPCRs). GPCKs constitute a superfamily of seven transmembrane 
spanning proteins that respond to a diverse aixay of sensory and chemical stimuli, 
such as light, odor, taste, pheromones, hormones and neurotransmitters. GPCRs play a 
20 central role in cell proliferation, differentiation, and have been implicated in the 
etiology of disease. 

The mechanism by which G protein-coupled receptors translate extracellular 
signals into cellular changes was initially envisionecl as a simple linear model: 

25 activation of the receptor by agonist binding leads to dissociation of the heterotrimeric 
GTP-binding G protein (Gs, Gi, or Gq) into its alpha and beta/gamma subunits, both 
of which can activate or inlnbit various downstream effector molecules. More 
specifically, activation of the GPCR induces a confomiational change in the Ga 
subunit, causing GDP to be released and GTP to be bound in its place. The Ga and 

30 GPy subunits then dissociate from the receptor and interact with a variety of effector 
molecules. For instance in the case of the Gs family, the primary function is to 
stimulate the intracellular messenger adenylate cyclase (AC), which catalyzes the 
conversion of cytoplasmic ATP into the secondary messenger cyclic AMP (cAMP). 
In contrast, the Gi family inhibits this pathway and the Gq family activates 



phospholipases C (PLC), which cleaves phosphatidylmositol 4,5, bisphosphate (PIP2) 
to generate inositol- 1,4,5-phosphate (IPS) and diacylglycerol (DAG). 

More recently, studies have shown that the functions of GPCRs are not limited 
to their actions on G-proteins and that considerable cross-talk exists between this 
diveise group of receptor molecules and a second class of membrane bound proteins, 
the receptor tyrosine kinases (RTKs). A number of GPCRs such as endothelin-1, 
thrombin, bombesin, and dopamine receptors can activate MAPKs, a downstream 
effector of the RTK/Ras pathway. Interestingly, the interaction between these two 
families is not unidirectional and RTKs can also modulate the activity of signaling 
pathways traditionally thought to be controUed exclusively by ligands that couple to 
GPCRs. For instance, EGF, which normally activates the MAPK cascade via the 
EGF receptor can stimulate adenylate cyclase activity by activating Gas. 

There are dozens of members of the G Protein-Coupled Receptor family that 
have emerged as prominent drug targets in the last decade. One non-limiting list of 
potential GPCR-siRNA targets is as follows: 

CMKLRl 

CMLl/ CMKLRl (Accession No. Q99788) is a member of the chemokine 
receptor family of GPCRs that may play a role in a number of diseases including 
those involved in inflammation and immunological responses (e.g. asthma, arthritis). 
For this reason, siRNA directed against this protein may^oye4o-be important 
therapeutic reagents. 



Studies of juvenile-onset neuronal ceroid lipofuscinosis (JNCL, Batten 
disease), the most common form of childhood encephalopathy that is characterized by 
progressive neural degeneration, show that it is brought on by mutations in a novel 
lysosomal membrane protein (CLN3). In addition to being implicated in JNCL, 
CLN3 (GPCR-like protein. Accession No..A57219) expression studies have shown 
that the CLN3 mRNA and protein are hi^y over-expressed in a number of cancers 
ie.g. glioblastomas, neuroblastomas, as well as cancers of the prostate, ovaiies, breast, 
and colon) suggesting a possible contribution of this gene to tumor growth. For this 



reason, siRNA directed against this protein may prove to be important therapeutic 
reagents. 



CLACR 



The calcitonin receptor (CTR/ CALCR, Accession No. NM_001742) belongs 
to "family B" of GPCRs which typically recognized regulatory peptides such as 
parathyroid honnone, secretin, glucagons and vasoactive intestinal polypeptide 
Although the CT receptor typically binds to calcitonin (CT), a 32 amino add peptide 
hormone produced primarily by the thyroid, association of the receptor with RAMP 
(Receptor Activity Modulating Protein) enables it to readily bind other members of 
the calcitonin peptide family including amyUn (AMY) and other CT gene-related 
peptides(e.^. aCGRP and PCGRP). While the primary function of the calcitonin 
receptor pertains to regulating osteoclast mediated bone resorption and enhanced Ca^^ 
excretion by the kidney, recent studies have shown that CT and CTRs may play an 
important role in a variety of processes as wide ranging as embryonic/foetal 
development and sperm function/physiology. In addition, studies haVe shown that 

patients with particular CTR genotypes may be at higher risk to lose bone mass and 
that this GPCR may contribute to the formation of calcimn oxalate urinary stones For 
this reason, siRNA directed against CTR may be useful as therapeutic reagents 



OXTR 



The human oxytocin receptor (OTR, OXTR) is a 389 amino acid polypq,tide 
that exhibits the seven transmembrane domain structure and belongs-to the Class-I 
(rhodopsin-type) family of G.prx>tein coupled receptor. OTR is expressed in a wide 
vanety of tissues throughout development and mediates physiological changes 
through G(q) proteins and phospholipase C-beta. Studies on the functions of oxytocin 
and the oxytocin receptor have revealed a broad list of duties. OT and OTR play a 
role in a host of sexual, maternal and social behaviors that include egg-laying, birth, 
milk-letdown, feeding, grooming, memory and learning. In addition, it has been 
hypothesized that abnormalities in the functionality of oxytocin-OTR receptor-ligand 
system can lead to a host of irregularities inchiding compulsive bdiavior, eating 
disorders (such as anorexia), depression, and various forms of neurodegenerative 



lUl 

diseases. For these reasons, siRNA directed against this gene (NM_00091 6) may play 
an important role in combating OTR-associated illnesses. 



EDG GPCR.S 

Lysophosphatidic acid and other lipid-based hormones/growth factors induce 
their effects by activating signaling pathways through the G-protein coupled receptors 
(GPCRs) and have been observed to play important roles in a number of human 
diseases including cancer, asthma, and vascular pathologies. For instance, during 
studies of immunoglobulin A nephropatliy (IgAN), researchers have observed an 
enhanced expression of EDG5 (NP_004221) suggesting a contribution of this gene 
product in the development of IgAN. For that reasons, siRNA directed against Edg5 
(NM_004230), Edg4 (NM_004720), Edg7 (Nm_012152) and related genes may play 
an important role in combating human disease. 

Genes Involved in Cholesterol Sienalinp smd Bio.svnthpsis: 

Studies on model genetic organisms such as Drosophila and C. elegans have 
led to the identification of a plethora of genes that are essential for early development 
Mutational analysis and ectopic expression shidies have allowed many of these genes 
to be grouped into discreet signal toransduction pathways and have shown that these 
elemaits play critical roles in pattern formation and cell differentiation. Disruption of 
one or more of tiiese genes during early stages of development frequentiy leads to 
birth defects whereas as alteration of gene function at later stages in life can result in 
tumorigenesis. _ _^ ^ 

One critical set of interactions known to exist in both invertebrates and 
vertebrates is the Sonic Hedgehog-Patched-GU pathway. OriginaUy documented as a 
Drosophila segmentation mutant, several labs have recently identified human and 
mouse orthologs of many of flie pathways members and have successfully related 
disruptions in these genes to known diseases. Pathway activation is initiated with the 
secretion of Sonic hedgehog. There are three closely related members of tiie Shh 
family (Sonic hedgehog. Desert, and Indian) with Shh being the most widely 
expressed form of the group. ITie Shh gene product is secreted as a small pro-signal 
molecule. To successfully initiate its developmental role, Shh is first cleaved, 
whereupon the N-tenninal tiimcated fragment is covalently modified with cholesterol. 



lUZ 



The addition of the sterol moiety promotes the interaction between Shh and its 
cognate membrane bound receptor. Patched (Ptch). There are at least two isoforms of 
the Patched gene, Ptchl and Ptch2. Both isoforms contain a sterol-sensing domain 
(SSD); a roughly 180 amino acid cluster that is found in at least seven different 
classes of molecules including those involved in cholesterol biosynthesis, vesicular 
traffic, signal transduction, cholesterol transport, and sterol homeostasis, hi the 
absence of Shh, the Patched protein is a negative regulator of the pathway. In contrast, 
binding of Shh-cholesterol to the Patched receptor releases the negative inhibition 
which that molecule enforces on a G-protein coupled receptor known as Smoothened, 
Subsequent activation of Smoothened (directly or indirectly) leads to the triggering of 
a trio of transcription factors that belong to the GK family. All three fectors are 
relatively large, contain a characteristic C2.H2 zinc-fingerpfentamer. and recognize 
one of two consensus sequences (SEQ. ID NO. 0463 GACCACCCA or SEQ. ID NO. 
0464 GAACCACCCA). In the absence of Shh, GH proteins are cleaved by the 
proteosome and the C-terminally truncated fragment translocates to the nucleus and 
acts as a dominant transcription repressor, hi the presence of Shh-cholesterol, Gli 
repressor formation is inhibited and fuU-length Gli functions as a transcriptional 
activator. 



Shh and other members of the Shh-PTCH-Gli pathway are expressed in a 
broad range of tissues (e.g. the notochord, die flooiplate of the neural tube, the brain, 
and the gut) at early stages in development. Not surprisingly, mutations tiiat lead to 
altered protein expression or function have been shown to induce developmental 
abnormahties. Defects in tiie human Shh gene have been shown to cause 
holoprosencephaly, a midline defect tiiat manifests itself as cleft Kp or palate, CNS 
septation, and a wide range of oth«: ph«iotypes. Interestingly, defects in chJlesteix>l 
biosynthesis generate similar Shh-Uke disorders (e.g. Smitii-Lemli-Opitz syndrome) 
suggesting that diolesterol modification of the Shh gene product is crucial for 
patiiway function. Boti, the Patched and Smootiiened genes have also been shown to 
be clinically relevant witii Smootiiened now being recognized as an oncogene tiiat, 
like PTCH-1 and PTCH-2, is believed to be tiie causative agent of sev^al forms of 
adult himors. For tiiese reasons, siRNA directed against Smootiiened (SMO. 
NM_005631), Patched (PTCH, mn_000264), and additional genes tiiat participate in 



cholesterol signaling, biosynfliesis, and degradation, have potentially useful research 
and therapeutic applications. 

Targeted Pathways . 

) In addition to targeting siRNA against one or more members of a family of 

proteins, siRNA can be directed against members of a pathway. Thus, for instance, 
siRNA can be directed against members of a signal transduction pathway (e.g. the 
insulin pathway, including AKTl-3, CBL, CBLB, EIF4EBP1, FOXOl A, F0X03A, 
FRAPl, GSK3A, GSK3B, IGFl, IGFIR, INPP5P, INSR, IRSl, MLLT7, PDPKl, 
0 PIK3CA, PIK3CB, PIK3R1, PIK3R2, PPP2R2B, PTEN, RPS6, RPS6KA1, 

RPX6KA3, SGK, TSCl, TSC2, AND XPOl), an apoptotic pathway (CASP3,6,7,8,9, 
DSHl/2, PllO, P85, PDKl/2, CATENIN, HSP90, CDC37, P23, BAD, BCLXL, 
BCL2, SMAC, and others), pathways, involved in DNA damage, cell cycle, and otha: 
physiological (p53,MDM2, CHKl/2, BRCAl/2, ATM, ATR, P15INK4, P27, P21, 
5 SKP2, CDC25C/A, 14-3-3, PLK, RB, CDK4, GLUT4, Inos, Mtor, FKBP, PPAR, 
RXR, ER). Similarly, genes involved in immune system function including TNFRl, 
IL-IR, IRAKl/2, TRAF2, TRAF6, TRADD, FADD, IKKe, IKKy, IKKp, IKKa, 
IkBa, IkBp, p50, p65, Rac, RhoA, Cdc42, ROCK, Pakl/2/3/4/5/6, cIAP, HDACl/2, 
CBP, p-TrCP, Rip2/4, and others are also important targets for the siRNAs desraibed 
20 in this documait and may be li^ftil in treating immune system disorders. Genes 

involved in apoptosis, such as Dshl/2,PTEN, PllO (pan), P85, PDKl/2, Aktl, Akt2, 
Akt (pan), p70^^^, GSK3P, PP2A (cat), p-catenin, HSP90, Cdc37/p50, P23, Bad, 
BcbtL, Bcl2, Smac/Diablo, and Askl are potentially usefid in the treaffirMt of '^'^ 
diseases that involve defects in progranuned cell death (e.g. cancer), while siRNA 
25 agents directed against p53, MDM2, Chkl/2, BRCAl/2, ATM, ATR, pi P27, 
P21, Skp2, Cdc25C/A, 14-3-3a/e, PLK, Rb, Cdk4, Glut4, iNOS, mTOR, FKBP, 
PPARy, RXRa, ERa and related genes may play a critical role in combating diseases 
associated with disruptions in DNA repair, and cell cycle abnormalities. 

30 Tables VI -Table X below provide examples of useful pools for inhibiting 

different genes in the human insulin pathway and tyrosine kinase pathways, proteins 
involved in the cell cycle, the production of nuclear receptors, and other genes. These 
particular pools are particularly useful in humans, but would be useful in any species 



that generates an a^ropriately homologous mRNA. Futther, wi,M„ each of the Usted 
pools any one seq„e«=e maybe used indq,ende„„y bu, p,efemb.y at leas, two of the 
hsted seqt^noes. more preferably at least three, and most preferably all of the listed 
sequences for a given gene is present. 



Table VI 

Gene 
Name 
AKT1 
AKT1 



AKri 



AKT1 



Acc# 

NM 005163" 
NM 005163 



NM 005163 



Gl 

14885060 
4885060 
14885060 



AKT2 
AKT2 
AKT2 
AKT2 
AKT2 
AKT3 
AKT3 
AKT3 



AKT3 

AKT3 

CBL 

CBL 

CBL 

CBL 



CBL 

CBLB 

CBLB 



CBLB 
CBLB 



CBLB 
EIF4EBP1 
EIF4EBP1 



EIF4EBP1 



EIF4EBP1 



EIF4EBP 1 
F0X01A 



F0X01A 



F0X01A 



F0X01A 
FOX01A 
F0X03A 
F0X03A 
F0X03A 
FOX03A 
F0X03A 



FRAP1 
-RAP1 
-RAP1 
FRAP1 
FRAP1 



NM 005163 Uftftfinfin 



NM. 001626 
NM 001626 
NM 001626 
NM 001626 

NM 005465 
NM 005465 



NM 005465 
NM 005465 

NM 005188 
NM 005188 
NM 005188 



6715585 
6715585 
6715585 
6715585 

32307164 
32307164 
32307164 
32307164 

4885116 
4885116 
14885116 



NM 005188 4885116 
NM 170662 2fl.-H6fiftn7 



NM_.1 70662 
NM_ 170662 



NM 170662 



29366807 
29366807 
129366807 



NM 004095 ?nn7ni7Q 



NM 004095 700701 7Q 



LL. 

207 
207 



I Duplex* 

p-003000-05 
P-003000-06 
|D-003000-07 



207|D-003000-08 



208 
208 
208 
208 



Sequence 



GACAAGGACGG6CACATTA 
GGACAAGGACGGGCACATf 
GCTACTTCCTCCTr A A^ /^ 



GACCGCC TCTGClTrrm nJ^ 



U-003001-05 1 G l ACTTCGATfiATHAZ^ - 
GCAAAGAGGGCATCAGTGA 
GGGCTAAAGTGACCATGAA 
GCAGAATGCCAGCTGATGA 



p-003001-06 
p-003001-07 
lD-003001-08 



10000 [ D-003Q0?-nR 
10000 0-003002-06 
10000 p-003002-07 
10000 |d-003002-08 



867 
867 
867 



p-003003-05 
p-003003-06 
ID-003003-07 



867 p-003QQa-nft 



868 
868 
868 
868 



ID-003004-05 



p-003004-06 
p-003004-07 
ID-003004-08 



1978 p-003005-nfi 



NM 004095 



NM 004095 



20070179 
[20070179 



NM 002015 



NM 002015 



M 002015 
M 002015 

M_001455" 
M 001455 
M 001455 
NM 001455 

\4_004958~ 
\/l_004958 
Vl_004958 
M 004958 



9257221 
19257221 



1978 p-00300R-nfi 



1978 
1978 



p-003005-07 
I D-003005-08 



9257221 
19257221 

4503738 
4503738 
4503738 
4503738 

119924298 
19924298 
19924298 

1 19924298 



2308 p-oo3nnR-nf^ 

2308 IP-OOSOOR-nR 



2308 p -0030nfi-n7 
23U8 p-003006-08 

2309p-003007-01 
2309 0-003007-02 
2309 0-003007-03 
2309|P-003007 -04 

2475 jD-003008-05 

2475 P-003008-06 

2475 I P-003008 -07 
2475 



GGAGTAAACTGGCAAGATfi 
GACATTAAATTTCCTCGAA 
GACCAAAGCCAAACACATT 
GAGGAGAGAATGAATTGTA 

ggagacacatttcggatta" 

GATCTGACCTGCAATGATT 
GACAATCCCTCAnAATAAA 



CCAGAAAGCTTTGGTCATT 
GACCATACCTCATAfl nAA/^- 



TGAAAGACCTCCACCAATC 

gatgaaggctccaggtgtt 
atcagcatttac6actta 

gcaatagcccagaagataa" 



CGCAATAGCCCAPiAAriATA 



GAGATGGACATTTAAAGCA 
CAATAGCCCARAAftATA^rv 



CCAGGCATCTCATAACAAA 
CCAGATGCCTATACAAACA 



GGAGGTATGAGTCAGTATA 
GAGGTATGAGTCAGTATAA 

CAATAGCAACAAGTATACC 
TGAAGTCCAGGACGATGAT 
TGTCACACTATGGTAACCA 
TGTTCAATGGGAGCTTGGA 

GAGAAGAAATGGAAGAAAT^ 

CCAAAGTGCTGCAGTACTA 

- — |GAc jCAT6CCGTC AATA/^TA 

D-003UU8-U8 1 GG rCTGAACTCAATY^lAgA^ 



SEQ. 
ID NO 



465 
466 
467 



468 

469 
470 
471 
472 

473 
474 
475 
476 

477 
478 
479 



480 



481 



482 
483 
484 

485 



486 



487 
488 



489 
490 



493 
494 
495 



496 



497 
498 
499 
500 



1U5 



|GSK3A j 














GSK3A h 


JM 01yoo4 1 


1 yy 0^ f 0 


\70 1 L 


)-003009-05 C 


5GACAAAGGTGTTCAAATC 


501 


GSK3A r 




1 yyo^ 1 0 £. 


J?0 1 L. 


)-003009-06 C 


5AACCCAGCTGCCTAACAA 


502 


GSK3A ^ 




1 yy o^f / 0 z 




)-003009-07 C 


5GGCACAGCTTCTTTGATG 


503 


GSK3A Y 


mil f\Ar%ooA Ia 

sIM u19oo4 r 


1 yy D*t #0 ^! 


1^0 1 L 




3CTCTAGCCTGCTGGAGTA 


504 


GSK3B 














GSK3B r 




1 1 OD I OOzf I 




3-003010-05 ( 


3AAGAAAGATGAGGTCTAT 


505 


GSK3B f 


IK A rVAOOAO I'' 

SlM 004uyo ii 


: 1 OD 1 ooy I 


1n70^ 


D-0030 10-06 ( 


3GACCCAAATGTCAAAGTA 


506 


GSK3B U 


sjM uozuyo L 


1 1 00 1 ooy i 




VOO'^010-07 ( 


3AAATGAACCCAAACTACA 


507 


GSK3B t 


SlM oo^oyo |i 


i. 1 OD 1 ooy i 


1^0^ 




3ATGAGGTCTATCTTAATC 


508 


jiGFI 1 














|IGF1 |l 


nM uOUdio 






0-003011-05 ( 


3GAAGTACATTTGAAGAAC 


509 


IGF1 1 


nM OOOdIo I 








^GAAGGAAGTACATTTGAA 


510 


|lGF1 II 


MM OOU0I0 1 






D-003011-07 


CCTCAAGCCTGCCAAGTCA 


511 


llGFI 1 


nM 000b 1o 








GGTGGATGCTCTTCAGTTC 


512 


IGF1R 














IIGFIR 1 


MM_000of 5 


1 lUDoUU^ 


OH-OU 


n-OO'^0 12-05 


CAACGAAGCTTCTGTGATG 


513 


IGF1R 


MM 000o7o 


1 lUDoUU^ 


OHOU 




GGCCAGAAATGGAGAATAA 


514 


IGF1R 


NM 000o7o 


1 1 UDOUU^ 


o*fou 


D-OO'^0 12-07 


GAAGCAGCGTTTAAGAATG 


515 


IIGFIR 1 


NM_00Uo/D 


1 lUDOUU^ 




n-00301 2-08 


GCAGACACCTACAACATCA 


516 


INPP5D 














INPP5D 


NM 005541 


ouo 1 /yo 


ODOO 


D-On'^OI'^-OB 


GGAATTGCGTTTACACTTA 


517 


INPP5D 


NM 005541 


CAQ4 70Q 

oOol /yo 


ODOO 


U— vWOv lO^lV 


GGAAACTGATCATTAAGAA 


518 


1NPP5D 


NM 005541 


oool /yo 


0000 




CGACAGGGATGAAGTACAA 


519 


INPP5D 


NM 005541 


ouo 1 f yo 


ODOO 




AAACGCAGCTGCCCATCTA 


520 


INSR 














INSR 


NM 000208 


4oo/c5oO 


ODHO 




GGAAGACGTTTGAGGATTA 


521 


INSR 


NM OOO2O0 


400 / ooO 


OD*fO 


H-On'^0 14-06 


GAACAAGGCTCCCGAGAGT 


522 


INSR 


NM OOO2O0 


400/ ooO 


ODHO 


H-On'^OI 4-07 


GGAGAGACCTTGGAAAl "IG 


523 


INSR 


NM OOO2O0 


4t30f OOP 


0D*fO 


n-fifi'^niA-nR 


GGACGGAACCCACCTATTT 


524 


IRS1 














IRS1 


NM 005544 


OOol O04 


ODDf 


V/VJvJw 1 "J \J\J 


AAAGAGGTCTGGCAAGTGA 


525 


iRS1 


NM _ 005544 


OUol oU4 


ODO-/^ 


0-00*^01 ^-06 


GAACCTGATTGGTATGTAC 


526 




IRS1 


]kIKA f\f\CCAA 

NM 005544 


1 oUOl oU4 


ODD / 


n-no'^oi s-07 


GCACGGGGATCTAGTGCTT 


527 




IRS1 


IkIKA f\f\CCAA 

NM 005544 


1 OOol oU4 


ODD/ 


0-00*^01 'WIR 


GTCAGTGTGTCGTGCAGTA 


528 




MLLT7 
















MLLT7 


NM 005938 


1 01 /40/ 0 


A'^Ci'^ 
*tOUO 


D-OO'^0 16-05 


GGACTGGACTTCAACTTTG 


529 




MLLT7 


NM 005908 


lDl/4DiO 


*tOUO 




CCACGAAGCAGTTCAAAXG 


530 




MLLT7 


I KIR A nr\cf\00 

NM 00o9o8 


1 51 /4Df 0 


'fOUO 


H-OO'^OI 6-07 


GAGAAGCGACTGACACTTG 


531 




MLLT7 


IkIRA /\f\CAOO 

NM 00o9oo 


1 Dl /4D/0 


A'^'\ 

*fOUO 


0-00^^016-08 


GACCAGAGATCGCTAACCA 


532 




PDPK1 
















PDPK1 


Ik IRA nn'^GAO 

NM 00261 0 


I 40U0Dy^ 




n-00'H017-05 


CAAGAGACCTCGTGGAGAA 


533 




PDPK1 


Ik IRA /\/^0£>'10 

NM 002613 


4000Dy4 


m7n 


0-00^^017-06 

L/^JvlvJv 1 1 \J\J 


GACCAGAGGCGAAGAA 1 \ \ 


534 




PDPK1 


IkIAA AnO£5'IO 

NM OO20I0 


1 40U0DyH' 


m7n 

0 1 # V 


n-00^01 7-07 


GGAAACGAGTATCTTATAT 


535 




PDPK1 


NM 002010 


1 *>-DU0Dy4 


m7n 


n-003017-Ofl 


► GAGAAGCGACATATCATAA 


536 




PIK3CA 
















[PIK3CA 


1 Kill A nnciOi Q 
NM UUdZiO 


\ OHOOoy 1 




D-003018-0E 


\ GCTATCATCTGAAGAATTA 


537 




PIK3CA 


IKIRA AACJO-I 0 

NM uObzIo 


\ DHpooy 1 




n-ooaoi8-of 


> GGATAGAGGCCAAATAATA 


538 




PIK3CA 


IkiRA AAf^OH 0 

NM 006218 


o4oooyi 




0-00*^0 18-0*3 


^ GGACAACTGTTTCATATAG 


53£ 




PIK3CA 


NM OObZIo 


1 O^DOOl? 1 




n-003018-Of 


GCCAGTACCTCATGG A 1 1 A 


54C 




PIK3CB 
















P1K3CB 


NM 006219 


1 o4Dooyo 




D-OO'^OIQ-O' 


5 CGACAAGACTGCCGAGAGA 


541 




PIK3CB 


NM 006219 


1 o4Dooyo 




D-003019-0( 


5 TCAAGTGTCTCCTAATATG 


54: 




lpiK3CB 


|NM 006219 


1 o4Dooyo 




D-003019-Q' 


7 GGATTCAGTTGGAGTGAI 1 


54: 




[piK3CB 


NM 006219 


5453893 


5291 


D-003019-01 


E TTTCAAGTGTCTCCTAATA 


54^ 




[PIK3R1 















PIK3R1 


NM_181504 


32455251 


5295 


D-003020-05 


GGAAATATGGCTTCTCTGA 


545 


PIK3R1 


NM 181504 


32455251 


5295 


D-003020-06 


GAAAGACGAGAGACCAATA 


546 


PIK3R1 


NM_181504 


32455251 


5295 


D-003020-07 


GTAAAGCATTGTGTCATAA 


547 


PIK3R1 


NM 181504 


32455251 


5295 


D-003020-08 


GGATCAAGTTGTCAAAGAA 


548 


PIK3R2 














PIK3R2 


NM 005027 


4826907 


5296 


D-003021-05 


GGAAAGGCGGGAACAATAA 


548 


PIK3R2 


NM_005027 


4826907 


5296 


D-003021-06 


GATGAAGCGTACTGCAATT 


550 


PIK3R2 


NM 005027 


4826907 


5296 


D-003021-07 


GGACAGCGAATCTCACTAC 


551 


PIK3R2 


NM 005027 


4826907 


5296 


D-003021-08 


GCAAGATCCGAGACXJAGTA 


552 


PPP2R2B 














PPP2R2B 


NM 004576 


4758953 


5521 


D-003G22-05 


GAATGCAGCTTACTTTCTT 


552 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-06 


GACCGAAGCTGACATTATC 


554 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-07 


TCGATTACCTGAAGAGTTT 


55£ 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-08 


CCTGAAGAGTTTAGAAATA 


556 


PTEN 














PTEN 


NM 000314 


4506248 


5728 


D-003023-05 


GTGAAGATCTTGACCAATG 


557 


PTEN 


NM 000314 


4506248 


5728 


D-003023-06 


GATCAGCATACACAAATTA 


558 


PTEN 


NM 000314 


4506248 


5728 


D-003023-07 


GGCGCTATGTGTATTATTA 


55£ 


PTEN 


NM 000314 


4506248 


5728 


D-003023-08 


GTATAGAGCGTGCAGATAA 


56C 


RPS6 














RPS6 


NM 001010 


17158043 


6194 


D-003024-05 


GCCAGAAACTCATTGAAGT 


561 


RPS6 


NM 001010 


17158043 


6194 


D-003024-06 


GGATATTCCTGGACTGACT 


565 


RPS6 


NM 001010 


17158043 


6194 


D-003024-07 


CCAAGGAGAACTGGAGAAA 


56C- 


RPS6 


NM 001010 


17168043 


6194 


D-003024-08 


GCGTATGGCCACAGAAGTT 


564 


RPS6KA1 














RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-05 


GATGACAGCTTCTACTTTG 


56£ 


RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-06 


GAGAATGGGCTCCTCATGA 


56€ 


RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-07 


CAAGCGGGATCCTTCAGAA 


567 


RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-08 


CCACCGGCCTGATGGAAGA 


56£ 


RPS6KA3 














RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-05 


GAAGGGAAGTTGTATCTTA 


56C 


RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-06 


GAAAGTATGTGTATGTAGT 


57C 


RPS6KA3 


NM 004586 


4759049 


6197- - 


C)-003026-07 


GGACAGCATCCAAACATTA 


571 


RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-08 


GGAGGTGAATTGCTGGATA 


575 


SGK 














SGK 


NM 005627 


5032090 


6446 


D-003027-01 


TTAATGGTGGAGAGTTGTT 


57c 


SGK 


NM 005627 


5032090 


6446 ■ 


D-003027-04 


ATTAACTGGGATGATCTCA 


574 


SGK 


NM 005627 


25168262 


6446 


D-003027-05 


GAAGAAAGCAATCCTGAAA . 


_57e 


SGK 


NM 005627 


25168262 


6446 


D-003027-06 


AAAGACAGCTGAAATGTAC - 


57f 


TSC1 














TSC1 


NM 000368 


24475626 


7248 


D-003028-05 


GAAGATGGCTATTCTGTGT 


577 


TSC1 


NM 000368 


24475626 


7248 


D-003028-06 


TATGAAGGCTCGAGAGTTA 


57£ 


TSC1 


NM 000368 


24475626 


7248 


D-003028-07 


CGACACGGCTGATAACTGA 


57£ 


TSC1 


NM 000368 


24475626 


7248 


D-003028-08 


CGGCTGATGTTGTTAAATA 


58C 


TSC2 














TSC2 


NM_000548 


10938006 


7249 


D-003029-05 


GCATTAATCTCTTACCATA 


581 


TSC2 


NM 000548 


10938006 


7249 


D-003029-06 


CGAATGTCCTCTTGTCTTT 


585 


TSC2 


NM 000548 


10938006 


7249 


D-003029-07 


GGAGACACATCACCTACTT 


58c 


TSC2 


NM 000548 


10938006 


7249 


D-003029-08 


TCACCAGGCTCATCAAGAA 


58^ 


XP01 














XP01 


NM 003400 


8051634 


7514 


D-003030-05 


GAAAGTCTCTGTCAAAATA 


58f 


XP01 


NM 003400 


8051634 


7514 


D-003030-06 


GCAATAGGCTCCATTAGTG 


58f 


ypni 




8051634 


7514 


D-003030-07 


GGAACATGATCAACTTATA 


587 


XP01 


NM 003400 


8051634 


7514 


D-003030-08 


GGATACAGATTCCATAAAT 


58{ 



1U7 



TabJe VII 



Gene 
Name 


Acc# 


Gl 


LL 


Duplex # 


SpntipnPA 

wv;^V|u^l IOC? 


or 

SE 
ID 


ABL1 














ABL1 


NM 007313 


6382057 


25 


D-0031 00-05 


G G AAATC Af^ Tn A P AT A r^Tr^ 


rr 

5 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-06 


G GTCC AC APTf^P A ATriiTTx 


5 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-07 


GAAGGAAATHAnTriAPATA 


c 
o 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-08 


TCACTGAGTTPATCArPTA 


c 

o 


ABL2 














ABL2 


NM 007314 


6382061 


27 


D-0031 01 -05 


GAAATGGAfnPRAAPAriATA 


o 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -06 


GAGCCAAATTTPPTATTAA 


D 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -07 


GTAATA AC5PPT A P A rt5TPT A 


c 
O 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -08 


GGAGTC5 A AriTTPriPTPTA a 


c 
O 


ACK1 














ACK1 


NM 005781 


8922074 


10188 


D-0031 02-05 


A A Ann P A A r^TPn Trtri ATn a 


5 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-06 


GCAAGTPr^Tr^r^ ATf:: Ar^TA a 


o 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-07 


GAGCAPTAPPTPAri A ATr* a 


5 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-08 


TCAGPARPAPPPAPTATTA 


D 


ALK 














ALK 


NM 004304 


29029631 


238 


D-0031 03-05 


GAPAAf^ATPPTf^p Ar: A AT A 

vj/iv.»r\MVj/A 1 Ov> 1 vjLrMvjMM i M 


D 


ALK 


NM 004304 


29029631 


238 


D-0031 03-06 


GfnAAf^AriTPTPrir* Ar^Txr* a 


6 


ALK 


NM 004304 


29029631 


238 


D-0031 03-07 


\J\^f^\^\D 1 OOV^ 1 OooVjAOA 1 1 


6 


ALK 


NM 004304 


29029631 


238 


D-0031 03-08 


G A APTf^ P A rxT^i A A r* A A A 
V3rv-\0 i vJV^Mo 1 VJMMijVjAAvjrA 


6 


AXL 














AXL 


NM 021913 


21536465 


558 


D-0031 04-05 


n riTP A A PT/^: A r» A TTT 


6 


AXL 


NM 021913 


21536465 


558 


D-0031 04-06 


(n A A A A A A 1^ A Oi^/^#^TTr A 

OAAMMoMMO^AVjAwLrOo 1 1 A 


6 


AXL 


NM 021913 


21536465 


558 


D-0031 04-07 


P P A A n A A ATPT A A A Tr* 
v^v-^rv-\UMMv3M 1 o 1 AL»AA 1 ovj 


6 


AXL 


NM 021913 


21536465 


558 


D-0031 04-08 


fnf^ A APTf^P ATnr^Tr* a A~m a 


6 


BLK 














BLK 


NM 001715 


4502412 


640 


D-0031 05-05 


GAGGATGCCTGCTGGATTT 


6 


Rl If 
DLI\ 


NM 001715 


4502412 


640 


D-4)031 05-06 


ACATGAAGGTGGCCATTAA 


6 


BLK 


NM 001715 


4502412 


640 


D-0031 05-07 


fif^TPAAPrjpppAA/^ Ar^A Ar* 
i vAAvJOv?Ov/L>AAVjA\^AAij 


6 


BLK 


NM 001715 


4502412 


640 


D-0031 05-08 


GAAAPTPf^rsf^TPTi^ri Ar* A A 


o 


BMX 














BMX 
..BMX 


NM 001721 
NM 001721 


21359831 
21359831 


660 
660 


D-0031 06-05 


A A AP A A APPTTTPT^T A r*T A 

/A/nA\V>MMMOO 1 1 1 L/Lr 1 AL/ 1 A 


0 


BMX 


NM 001721 


21359831 


660 


D-0031 06-06 
D-0031 06-07 


G A A nn A n P ATTT ATr^ nTT A 
GAnAAnAPATTAPPTTnTT 


-& 

o 


BMX 


NM 001721 


21359831 


660 


D-0031 06-08 


GTA Ar^f^PTriTP A ATfi ATA A 
^ 1 rvAvjov^ 1 Vj 1 oMA 1 oA 1 AA 


o 


BTK 














BTK 


NM 000061 


4557376 


695 


D-0031 07-05 


GAAPAf^(^AAT(^riAAr*PTTA 


6 


BTK 


NM 000061 


4557376 


695 


D-0031 07-06 


GPTATnr^fiPTnPP A A AIM 


D 


BTK 


NM 000061 


4557376 


695 


D-0031 07-07 


RAAAnPAAPTTAPPATrir2X 


6 


BTK 


NM 000061 


4557376 


695 


D-0031 07-08 


GfiTAAAPQATPAArir^AOTT 
VJV3 1 rV-V\VyUM 1 V^AAIJOAO 1 1 


6. 


C20orf64 














C20orf64 


NM 033550 


19923655 


11285 


D-0031 08-05 


CAAPTTAnPP A A^i AP A ATT 
y^rKTW^ 1 1 r\\Jv#\»/rV\OAOAA 1 \ 


6. 


C20orf64 


NM 033550 


19923655 


11285 


D-0031 08-06 


G A A ATTr^ A A npTP A r::Tri A 


D. 


C20orf64 


NM 033550 


19923655 


11285 


D-0031 08-07 


TGf^AAPAPiPTri A APATTriT 

' VJVJrVAOMOV-/ 1 vJMMv/M 1 1 1 


6. 


C20orf64 


NM 033550 


19923655 


11285 


D-0031 08-08 


GPTTPP A A PTr^ PTT AT A T A 


D. 


CSF1R 














ObrlR 


NM 005211 


27262658 


1436 


D-0031 09-05 


GGAGAGCTCTGACGTTTGA 


6: 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-06 


CAACAACGCTACCTTCCAA 


6: 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-07 


CCACGCAGCTGCCTTACAA 


6; 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-08 


GGAACAACCTGCAGTTTGG 


6. 


CSK 















AW 



CSK 


NM 004383 


4758077 


1445 


D-0031 10-05 


CAGAATGTATTGCCAAGTA 


6 


CSK 


NM 004383 


4758077 


1445 


D-0031 10-06 


GAACAAAGTCGCCGTCAAG 


6 


CSK 


NM_004383 


4758077 


1445 


D-0031 10-07 


GCGAGTGCCTTATCCAAGA 


6 


CSK 


NM 004383 


4758077 


1445 


D-0031 10-08 


GGAGAAGGGCTACAAGATG 


6 


DDR1 














DDR1 


NM 013994 


7669484 


780 


D-0031 11-05 


GGAGATGGAGI 1 IGAGrTT 


6 


DDR1 


NM 013994 


7669484 


780 


D-0031 11 -06 


CAGAGGCCCTGTCATCTTT 


6 


DDR1 


NM 013994 


7669484 


780 


D-003111-07 


GCTGGTAGCTGTCAAGATC 


6 


DDR1 


NM 013994 


7669484 


780 


D-003111-08 


TGAAAGAGGTGAAGATCAT 


6 


DDR2 














DDR2 


NM 006182 


5453813 


4921 


D-0031 12-05 


GGTAAGAACTACACAATCA 


6 


DDR2 


NM 006182 


5453813 


4921 


D-0031 12-06 


GAACGAGAGTGCCACCAAT 


6 


DDR2 


NM 006182 


5453813 


4921 


D-0031 12-07 


ACACCAATCTGAAGTTTAT 


6 


DDR2 


NM_006182 


5453813 


4921 


D-003112-08 


CAACAAGAATGCGAGGAAT 


6 


DKFZp761 
P1010 














DKFZp761 
P1010 


NM_018423 


8922178 


55359 


D-0031 13-05 


CCTAGAAGCTGCCATTAAA 


6 


DKFZp761 
P1010 


NM_018423 


8922178 


55359 


D-0031 13^6 


GATTAGGCCTGGCTTATGA 


6 


DKFZp761 
P1010 


NM_018423 


8922178 


55359 


D-0031 13-07 


CCCAGTAGCTGCACACATA 


6 


DKFZp761 
rlOlO 


NM_018423 


8922178 


55359 


0^)03113-08 


GGTGGTACCTGAACTGTAT 


6 


EGFR 














EGFR 


NM 005228 


4885198 


1956 


D-0031 14-05 


GAAGGAAACTGAATTCAAA 


6 


EGFR 


NM 005228 


4885198 


1956 


D-0031 14-06 


GGAAATATGTACTACGAAA 


6 


EGFR 


NM 005228 


4885198 


1956 


D-0031 14-07 


CCACAAAGCAGTGAATTTA 


6 


EGFR 


NM 005228 


4885198 


1956 


D-0031 14-08 


GTAACAAGCTCACGGAGTT 


6 


EPHA1 














EPHA1 


NM 005232 


4885208 


2041 


D-0031 15-05 


GACCAGAGCrrCACCATTC 


6 


EPHA1 


NM 005232 


4885208 


2041 


D-0031 15-06 


GCAAGACTGTGGCCATTAA 


6 


EPHA1 


NM 005232 


4885208 


2041 


D-003115-07 


GGGCGAACCTGACCTATGA 


6 


EPHA1 


NM 005232 


4885208 


2041 


D-0031 15-08 


GATTGTAGCCGTCATCTTT 


6 


EPHA2 














EPHA2 


NM 004431 


4758277 


1969 


D-0031 .16-05 


GGAGGGATCTGGCAACTTG 


6 


EPHA2 


NM 004431 


4758277 


1969 


D-0031 16-06 


GCAGCAAGGTGCACGAATT 


6 


EPHA'2 


NM 004431 


4758277 


1969 


D-003116-07 


GGAGAAGGATGGCGAGTTC 


"^6 


EPHA2 


NM 004431 


4758277 


1969 


D-003116-08 


GAAGTTCACTACCGAGATC 


6 


EPHA3 














EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-05 


GATCGGACCTCCAGAAATA 


6 


EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-06 


GAACTCAGCTCAGAAGATT 


6 


EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-07 


GCAAGAGGCACAAATGTTA 


6 


EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-08 


GAGCATCAGTTTACAAAGA 


6 


EPHA4 














EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-05 


GGTCTGGGATGAAGTATTT 


6 


EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-06 


GAATGAAGTTACCTTATTG 


6 


EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-07 


GAACTTGGGTGGATAGCAA 


6 


EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-08. 


GAGATTAAATTCACCTTGA 


6 


EPHA7 














EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-05 


GAAAAGAGATGTTGCAGTA 


6 


EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-06 


CTAGATGCCTCCTGTATTA 


6 


EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-07 


AGAAGAAGGTTATCGTTTA 


6 


EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-08 


TAGCAAAGCTGACCAAGAA 


6 


EPnAo 














EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-05 


GAAGATGCACTATCAGAAT 


6 



II wv 



EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-06 


GAGAAGATGCACTATCAG5A 




EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-07 


AACCTGATCTCGAGTGTGA 




EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-08 


TCTCAGACCTGGGCTATGT 




EPHB1 














EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -05 


GCGATAAGCTCCAGCATTA 




EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -06 


GAAACGGGCTTATAGCAAA 




EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -07 


GGATGAAGATCTACATTGA 




EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -08 


GCACGTCTCTGTCAACATC 




EPHB2 














EPHB2 


NM 017449 


17975764 


2048 


D-0031 22-05 


ACTATGAGCTGCAGTACTA 




EPHB2 


NM 017449 


17975764 


2048 


D-0031 22-06 


GTACAACGCCACAGCCATA 




EPHB2 


NM 017449 


17975764 


2048 


D-0031 22-07 


GGAAAGCAATGACTGTTCT 




EPHB2 


NM_017449 


17975764 


2048 


D-0031 22-08 


CGGACAAGCTGCAACACTA 




EPHB3 














EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-05 


GGTGTGATCTCCAATGTGA 




EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-06 


GGGATGACCTCCTGTACAA 




EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-07 


CAGAAGACCTGCTCCGTAT 




EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-08 


GAGATGAAGTACTTTGAGA 




EPHB4 














EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-05 


GGACAAACACGGACAGTAT 




EPHB4 


NM 004444 


17976769 


2050 


D-0031 24-06 


GTACTAAGGTGTACATCGA 




EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-07 


GGAGAGAAGCAGAATATTC 




EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-08 


GCCAATAGCCACTCTAACA 




EPHB6 














EPHB6 


NM_ 004445 


4758291 


2051 


D-0031 25-05 


GGAAGTCGATCCTGCTTAT 




EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-06 


GGACCAAGGTGGACACAAT 




EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-07 


TGTGGGAAGTGATGAGTTA 




EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-08 


CGGGAGACCTTCACCCTTT 




ERBB2 














ERBB2 


NM 004448 


4758297 


2064 


D-0031 26-05 


GGACGAATTCTGCACAATG 




ERBB2 


NM 004448 


4758297 


2064 


D-0031 26-06 


GACGAATTCTGCACAATGG 




ERBB2 


NM 004448 


4758297 


2064 


D-0031 26-07 


CTACAACACAGACACGTTT 




ERBB2 


NM 004448 


4758297 


2064 


D-0031 26.0&' 


AGACGAAGCATACGTGATG 




ERBB3 








- 






ERBB3 


NM 001982 


4503596 


2065 


D-0031 27-05 


AAGAGGATGTGAACGGTTA 




ERBB3 


NM 001982 


4503596 


2065 


D-0031 27-06 


GAAGACTGCCAGACATTGA 




ERBB3 


NM 001982 


4503596 


2065 


D-0031 27-07 


GACAAACACTGGTGCTGAT 




ERBB3,. 


NM 001982 


4503596 


2065 


D-0031 27-08 


GCAGTGGATTCGAGAAGTG 




ERBB^l 














ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-05 


GAGGAAAGATGCCAATTAA 




ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-06 


GCAGGAAACATCTATATTA 




ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-07 


GATCACAACTGCTGCTTAA 




ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-08 


CCTCAAAGATACCTAGTTA 




PER 












— 


PER 


NM 005246 


4885230 


2241 


D-0031 29-05 


GGAGTGACCTGAAGAATTC 




PER 


NM 005246 


4885230 


2241 


D-0031 29-06 


TAAAGCAGATTCGGATTAA 




PER 


NM 005246 


4885230 


2241 


D-0031 29-07 


GGAAAGTACTGTCCAAATG 




PER 


NM 005246 


4885230 


2241 


D-0031 29-08 


GAACAACGGCTGCTAAAGA 




FES 














PES 


NM 002005 


13376997 


2242 


D-0031 30-05 


CGAGGATCCTGAAGCAGTA . 




PES 


NM 002005 


13376997 


2242 


D-0031 30-06 


AGGAATACCTGGAGATTAG 




FES 


NM 002005 


13376997 


2242 


D-0031 30-07 


CAACAGGAGCTCCGGAATG 




FES 


NM 002005 


13376997 


2242 


D-0031 30-08 


GGTGTTGGGTGAGCAGATT 




PGFR1 














FGPR1 


MIVi \J\J\J\j\J*f 




22b0 


D-0031 31 -05 


TAAGAAATGTCTCCTTTGA 




FGFR1 


NM_000604 


13186232 


2260 


D-0031 31-06 


GAAGACTGCTGGAGTTAAT 





FGFR1 


NM 000604 


13186232 


2260 


D-0031 31-07 


GATGGTCCCTTGTATGTCA 


7 


FGFR1 


NM 000604 


13186232 


2260 


D-003131-08 


CTTAAGAAATGTCTCCTTT 


7 


FGFR2 














FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-05 


CCAAATCTCTCAACCAGAA 


7 


FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-06 


GAACAGTATTCACCTAGTT 


7 


FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-07 


GGCCAACACTGTCAAGTTT 


7 


FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-08 


GTGAAGATGTTG AAAGATG 


7 


FGFR3 














FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-05 


TGTCGGACCTGGTGTCTGA 


7 


FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-06 


GCATCAAGCTGCGGCATCA 


7 


FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-07 


GGACGGCACACCCTACGTT 


7 


FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-08 


TGCACAACCTCGACTACTA 


7 


FGFR4 














FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-05 


GCACTGGAGTCTCGTGATG 


7 


FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-06 


CATAGGGACCTCTCGAATA 


7 


FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-07 


ATACGGACATCATCCTGTA 


7 


FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-08 


ATAGGGACCTCTCGAATAG 


7 


FGR 














FGR 


NM 005248 


4885234 


2268 


D-0031 35-05 


GCGATCATGTGAAGCATTA 


7 


FGR 


NM 005248 


4885234 


2268 


D-0031 35-06 


TCACTGAGCTCATCACCAA 


FGR 


NM 005248 


4885234 


2268 


D-0031 35-07 


GAAGAGTGGTACTTTGGAA 


-f 


FGR 


NM 005248 


4885234 


2268 


D-0031 35-08 


CCCAGAAGCTGCCCTCTTT 




FLT1 














FLT1 


NM 002019 


4503748 


2321 


D-0031 36-05 


GAGCAAACGTGACTTATTT 


-f 


FLT1 


NM 002019 


4503748 


2321 


D-0031 36-06 


CCAAATGGGTTTCATGTTA 




FLT1 


NM 002019 


4503748 


2321 


D-0031 36-07 


CAACAAGGATGCAGCACTA 


'f 


FLT1 


NM 002019 


4503748 


2321 


D-0031 36-08 


GGACGTAACTGAAGAGGAT 


-1 


FLT3 














FLT3 


NM 004119 


4758395 


2322 


D-0031 37-05 


GAAGGCATCTACACCATTA 


-f 


FLT3 


NM 004119 


4758395 


2322 


D-0031 37-06 


GAAGGAGTCTGGAATAGAA 


-1 


FLT3 


NM 004119 


4758395 


2322 


D-0031 37-07 


GAATTTAAGTCGTGTGTTC 


-f 


FLT3 


NM 004119 


4758395 


2322 


D-0031 37-08 


GGAATTCATTTCACTCTGA 




FLT4 














FLT4 


NM 002020 


4503752 


2324 


D-0031 38-05 


GCAAGAACGTGCATCTGTT 


^ 


FLT4 


NM 002020 


4503752 


2324 


D-0031 38-06 


GCGAATACCTGTCCTACGA 




FLT4 


NM 002020 


4503752 


2324 


D-0031 38-07 


GAAGACATTTGAGGAATTC 




FLT4 


NM 002020 


4503752 


2324 


D-0031 38-08 


GAGCAGCCATTCATCAACA 


- 


FRK., 








- 






FRK *- 


NM 002031 


4503786 


2444 


D-0031 39-05 


GAAACAGACTCTTCATATT 


- 


FRK 


NM 002031 


4503786 


2444 


D-0031 39-06 


GAACAATACCACTCCAGTA 




FRK 


NM 002031 


4503786 


2444 


D-0031 39-07 


CAAGACCGGTTCCTTTCTA 


- 


FRK 


NM 002031 


4503786 


2444 


D-0031 39-08 


GCAAGAATATCTCCAAAAT 


- 


FYN 














FYN 


NM 002037 


23510344 


2534 


D-0031 40-05 


GGAATGGACTCATATGCAA 


- 


FYN 


NM 002037 


23510344 


2534 


D-0031 40-06 


GCAGAAGAGTGGTACTTTG 


- 


FYN 


NM 002037 


23510344 


2534 


D-0031 40-07 


CAAAGGAAGTTTACTGGAT 


- 


FYN 


NM 002037 


23510344 


2534 


D-0031 40-08 


GAAGAGTGGTACTTTGGAA 


- 


HCK 














HCK 


NM 002110 


4504356 


3055 


D-0031 41 -05 


GAGATACCGTGAAACATTA 


• 


HCK 


NM 002110 


4504356 


3055 


D-0031 41 -06 


GCAGGGAGATACCGTGAAA 


- 


HCK 


NM 002110 


4504356 


3055 


D-0031 41 -07 


CATCGTGGTTGCCCTGTAT 


- 


HCK 


NM 002110 


4504356 


3055 


D-0031 41 -08 


TGTGTAAGATTGCTGACTT 


- 


ITK 














ITK 


NM 005546 


21614549 


3702 


D-0031 44-05 


CAAATAATCTGGAAACCTA 


■; 


ITK 


NM 005546 


21614549 


3702 


D-003144-06 


GAAGAAACGAGGAATAATA 




ITK 


NM 005546 


21614549 


3702 


D-003144-07 


GAAACTCTCTCATCCCAAA 





Ill 



ITK 


NM 005546 


21614549 


3702 


D-0031 44-08 


GGAATGGGCATGAAGGATA 


7 


JAK1 














JAK1 


NM 002227 


4504802 


3716 


D-0031 45-05 


CCACATAGCTGATCTGAAA 


7 


JAK1 


NM 002227 


4504802 


3716 


D-0031 45-06 


TGAAATCACTCACATTGTA 


7 


JAK1 


NM 002227 


4504802 


3716 


D-0031 45-07 


TAAGGAACCTCTATCATGA 


7 


JAK1 


NM 002227 


4504802 


3716 


D-0031 45-08 


GCAGGTGGCTGTTAAATCT 


7 


JAK2 














JAK2 


NM 004972 


13325062 


3717 


D-0031 46-05 


GCAAATAGATCGAGTTC7T 


7 


JAK2 


NM 004972 


13325062 


3717 


D-0031 46-06 


GAGCAAAGATCCAAGACTA 


7 


JAK2 


NM 004972 


13325062 


3717 


D-0031 46-07 


GCCAGAAACTTGAAACTTA 


7 


JAK2 


NM 004972 


13325062 


3717 


D-0031 46-08 


GTACAGATTTCGCAGATTT 


7 


JAK3 














JAK3 


NM 000215 


4557680 


3718 


D-0031 47-05 


GCGCCTATCTTTCTCCTTT 


7 


JAK3 


NM 000215 


4557680 


3718 


D-0031 47-06 


CCAGAAATCGTAGACATTA 


7 


JAK3 


NM 000215 


4557680 


3718 


D-0031 47-07 


CCTCATCTCTTCAGACTAT 


7 


JAK3 


NM 000215 


4557680 


3718 


D-0031 47-08 


TGTACGAGCTCTTCAGCTA 


7 


KDR 














KDR 


NM 002253 


11321596 


3791 


D-0031 48-05 


GGAAATCTCTTGCAAGCTA 


7 


KDR 


NM 002253 


11321596 


3791 


D-0031 48-06 


GATTACAGATCTCCATTTA 


7 


KDR 


NM 002253 


11321596 


3791 


D-0031 48-07 


GCAGACAGATCTACGTTTG 


7 


KDR 


NM 002253 


11321596 


3791 


D-0031 48-08 


GCGATGGCCTCTTCTGTAA 


7 


KIAA1079 














KIAA1079 


NM 014916 


7662475 


22853 


D-0031 49-05 


GAAATTCTCTCAACTGATG 


7 


KIAA1079 


NM 014916 


7662475 


22853 


D-0031 49-06 


GCAGAGGTCTTCACACTTT 


7 


K1AA1079 


NM 014916 


7662475 


22853 


D-0031 49-07 


TAAATGATCTTCAGACAGA 


7 


KIAA1079 


NM 014916 


7662475 


22853 


D-0031 49-08 


GAGCAGCCCTACTCTGATA 


7 


KIT 














KIT 


NM 000222 


4557694 


3815 


D-0031 50-05 


AAACACGGCTTAAGCAATT 


7 


KIT 


NM 000222 


4557694 


3815 


D-0031 50-06 


GAACAGAACCTTCACTGAT 


7 


KIT 


NM 000222 


4557694 


3815 


D-0031 50-07 


GGGAAGCCCTCATGTCTGA 


7 


KIT 


NM 000222 


4557694 


3815 


D-0031 50-08 


GCAATTGCAnTATGTGTT 


7 


LCK 














LCK 


NM 005356 


20428651 


3932 


D-0031 51 -05 


GAAeTGCCATTATCGCATA 


7 


LCK 


NM 005356 


20428651 


3932 


D-0031 51-06 


GAGAGGTGGTGAAACATTA 


7 


LCK 


NM 005356 


20428651 


3932 


D-0031 51 -07 


GGGCCAAGTTTCCCATTAA 


7 


LCK 


NM 005356 


20428651 


3932 


D-0031 51 -08 


GGACGCTGCTCATCCGAAA 


7 


LTK 














LTK 


NM 002344 


4505044 


4058 


D-0031 52-05 - 


TGAATTCACTCCTGCCAAT 


7 


LTK 


NM 002344 


4505044 


4058 


D-0031 52-06 


GTGGCAACCTCAAGACTGA 


7 


LTK 


NM 002344 


4505044 


4058 


D-0031 52-07 


GGAGCTAGCTGTGGATAAC 


7 


LTK 


NM 002344 


4505044 


4058 


D-0031 52-08 


GCAAGTTTCGCCATCAGAA 


7 


LYN 














LYN 


NM 002350 


4505054 


4067 


D-0031 53-05 


GCAGATGGCTTGTGCAGAA 


7 


LYN 


NM 002350 


4505054 


4067 


D-0031 53-06 


GGAGAAGGCTTGTATTAGT 


7 


LYN 


NM 002350 


4505054 


4067 


D-0031 53-07 


GATGAGCTCTATGACATTA 


7 


LYN 


NM 002350 


4505054 


4067 


D-0031 53-08 


GGTGCTAAGTTCCCTATTA 


7 


MATK 














MATK 


NM 002378 


21450841 


4145 


D-0031 54-05 


TGAAGAATATCAAGTGTGA 


7 


MATK 


NM 002378 


21450841 


4145 


D-0031 54-06 


CCGCTCAGCTCCTGCAGTT 


7 


MATK 


NM 002378 


21450841 


4145 


D-0031 54-07 


TACTGAACCTGCAGCATTT 


7 


MATK 


NM 002378 


21450841 


4145 


D-0031 54-08 


TGGGAGGTC7TCTCATATG 


8 


MERTK 














MERTK 


NM 006343 


5453737 


10461 


D-0031 55-05 


GAACTTACCTTACATAGOT 


8 


MERTK 


NM 006343 


5453737 


10461 


D-0031 55-06 


GGACCTGCATACTTACTTA 


8 


MFRTK 


MM nn^o/i Q 
IMIVI uuoo4o 


0453737 


10461 


D-0031 55-07 


TGACAGGAATCTTCTAATT 


8 


MERTK 


NM 006343 


5453737 


10461 


D-0031 55-08 


GGTAATGGCTCAGTCATGA 


8 



El 



HZ 



MET 














MET 


NM 000245 


4557746 


4233 


D-0031 56-05 


GAAAGAACCTCTCAACATT 


8 


MET 


NM 000245 


4557746 


4233 


D-0031 56-06 


GGACAAGGCTGACCATATG 


8 


MET 


NM 000245 


4557746 


4233 


D-0031 56-07 . 


CCAATGACCTGCTGAAATT 


8 


MET 


NM 000245 


4557746 


4233 


D-0031 56-08 


GAGCATACATTAAACCAAA 


8 


MST1R 














MST1R 


NM 002447 


4505264 


4486 


D-0031 57-05 


GGATGGAGCTGCTGGCTTT 


8 


MST1R 


NM 002447 


4505264 


4486 


D-0031 57-06 


CTGCAGACCTATAGATTTA 


8 


MST1R 


NM 002447 


4505264 


4486 


D-0031 57-07 


GCACCTGTCTCACTCTTGA 


8 


MST1R 


NM 002447 


4505264 


4486 


D-0031 57-08 


GAAAGAGTCCATCCAGCTA 


8 


MUSK 














MUSK 


NM 005592 


5031926 


4593 


D-0031 58-05 


GAAGAAGCCTCG6CAGATA 


8 


MUSK 


NM 005592 


5031926 


4593 


D-0031 58-06 


GTAATAATCTCCATCATGT 


8 


MUSK 


NM 005592 


5031926 


4593 


D-0031 58-07 


GGAATGAACTGAAAGTAGT 


8 


MUSK 


NM_005592 


5031926 


4593 


D-0031 58-08 


GAGATTTCCTGGACTAGAA 


8 


NTRK1 














NTRK1 


NM 002529 


4585711 


4914 


D-0031 59-05 


GGACAACCCTTTCGAGTTC 


8 


NTRK1 


NM 002529 


4585711 


4914 


D-0031 59-06 


CCAGTGACCTCAACAGGAA 


8 


NTRK1 


NM 002529 


4585711 


4914 


D-0031 59-07 


CCACAATACTTCAGTGATG 


8 


NTRK1 


NM 002529 


4585711 


4914 


D-0031 59-08 


GAAGAGTGGTCTCCGTTTC 


8 


NTRK2 














NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-05 


GAACAGAAGTAATGAAATC 


8 


NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-06 


GTAATGCTGTTTCTGCTTA 


8 


NTRK2 


NM_006180 


21361305 


4915 


D-003160-07 


GCAAGACACTCCAAGTTTG 


8 


NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-08 


GAAAGTCTATCACATTATC 


8 


NTRK3 














NTRK3 


NM 002530 


4505474 


4916 


D-0031 61 -05 


GAGCGAATCTGCTAGTGAA 


8 


NTRK3 


NM 002530 


4505474 


4916 


D-0031 61 -06 


GAAGTTCACTACAGAGAGT 


8 


NTRK3 


NM 002530 


4505474 


4916 


D-0031 61-07 


GGTCGACGGTCCAAATTTG 


8 


NTRK3 


NM 002530 


4505474 


4916 


D-0031 61-08 


GAATATCACTTCCATACAC 


8 


PDGFRA 














PDGFRA 


NM 006206 


15451787 


5156 


D-003162-05 


GAAACTTCCTGGACTATTT 


8 


PDGFRA 


NM 006206 


15451787 


5156 


D-003162-06 


GAGATIsrGGTCAACIAI 1 1 


8 


PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-07 


GCACGCCGCTTCCTGATAT 


8 


PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-08 


CATCAGAGCTGGATCTAGA 


8 


PDGFRB 














PDGFRB 


NM 002609 


15451788 


5159 


D-0031 63-05 


GAAAGGAGACGTCAAATAT 


8 


PDGFRB 


NM .002609 


15451788 


5159 


D-0031 63-06 


GGAATGAGGTGGTCAACTT 


8 


PDGFRB 


NM '002609 


15451788 


5159 


D-0031 63-07 


CAACGAGTCTCCAGTGCTA 


8 


PDGFRB 


NM 002609 


15451788 


5159 


D-0031 63-08 


GAGAGGACCTGCCGAGCAA 


8 


PTK2 














PTK2 


NM 005607 


27886592 


5747 


D-0031 64-05 


GAAGTTGGGTTGTCTAGAA 


8 


PTK2 


NM 005607 


27886592 


5747 


D-0031 64-06 


GAAGAACAATGATGTAATC 


8 


PTK2 


NM 005607 


27886592 


5747 


D-0031 64-07 


GGAAATTGCTTTGAAGTTG 


8 


PTK2 


NM 005607 


27886592 


5747 


D-0031 64-08 


GGTTCAAGCTGGATTATTT 


8 


PTK2B 














PTK2B 


NM 004103 


27886583 


2185 


D-0031 65-05 


GAACATGGCTGACCTCATA 


8 


PTK2B 


NM 004103 


27886583 


2185 


D-0031 65-06 


GGACCACGCTGCTCTATTT 


8 


PTK2B 


NM 004103 


27886583 


2185 


D-003165-07 


GGACGAGGACTATTACAAA 


8 


PTK2B 


NM 004103 


27886583 


2185 


D-003165-08 


TGGCAGAGGTCATCAACAA 


& 


PTK6 














PTK6 


NM 005975 


27886594 


5753 


D-0031 66-05 


GAGAAAGTCCTGCCCGTTT 


8 


PTK6 


NM 005975 


27886594 


5753 


D-0031 66-06 


TGAAGAAGCTGCGGCACAA 


8 


PTK6 


NM 005975 


27886594 


5753 


D-0031 66-07 


CGGCGACTCTGATGAGAAA 


a 


PTKR 


INlVI UUOU/D 


^7886594 


5753 


D-0031 66-08 


TGCCCGAGCTTGTGAACTA 


8 


PTK7 















PTK7 


NM 002821 


27886610 


5754 


D-0031 67-05 


GAGAGAAGCCCACTATTAA 1 ft 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-06 




8 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-07 


GGAGGfi AfiTTriri A Axr^TT 

V->v-»/-\\JVJV3Mv3 1 1 ooAoA loll 


8 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-08 


GAAGAnATnpPi^PTATTTr^ 


o 
O 


PTK9 














PTK9 


NM 002822 


4506274 


5756 


D-0031 68-05 


G AAG A AHT A CC^An a n att a 

vi»rv\vjAVA^ 1 MV>OAv^AoA 1 1 A 


o 
o 


PTK9 


NM 002822 


4506274 


5756 


D-0031 68-09 


\jrv/Av:7V3MOAvjr ! A II 1 AoAo 1 


o 
O 


PTK9 


NM 002822 


4506274 


5756 


D-0031 68-10 


G AGnnr^ ATrir'TriT ATTr^T A 
vj/nvjv-rov3/-\ 1 vjO I o I a I 1 U 1 A 


o 
O 


PTK9 


NM 002822 


4506274 


5756 


D-0031 68-11 


CTnn Arj APTTr*r*TTT at/^ a 
^ * vJwAWJMO 1 1 OL» 1 i 1 A 1 IjA 


8 


PTK9L 














PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-05 


AG AG Ari AriPTPO A rtn A o A X 

r^^rVJr\\Dr\\3K^ 1 oOAoOAoA i 


Q 

0 


PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-06 


TTAAnf^ Af^l^Tri AA/^Ar^Ar^A 
• • mmv^umoVp I oAAo AoAo A 


o 
0 


PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-07 


A P A r* A A P O P A r* A X T 


o 

a 


PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-08 


GHTri C^C^ ATP AfZHA PT Axr» a 
1 1 OAooAU 1 A 1 IjA 


o 
o 


RET 














RET 


NM 000323 


21536316 


5979 


D-0031 70-05 


(nPAAAf^ Ar*PTriP AP A ap at 
vjo/A/AAoAOo 1 ovjjAvjAAVjA 1 


o 
0 


RET 


NM 000323 


21536316 


5979 


D-0031 70-06 


GPAPAPf^f^PTriPATP AP A A 
VJV^A\V-»A\0OV30 1 oOA 1 v^AoAA 


o 
C 


RET 


NM 000323 


21536316 


5979 


D-0031 70-07 


n A A PTHif^PPTnri A A A r^T 
1 ooow/ 1 ovaAoAoAo 1 


Q 

c 


RET 


NM 000323 


21536316 


5979 


D-0031 70-08 


It A A ATf^f^ ATririP A ATTr* A 


o 

c 


R0R1 




■- 










R0R1 


NM 005012 


4826867 


4919 


D-0031 71 -05 


vjOMAoUA 1 Lr 1 1 1 AU 1 AooA 


fi 


R0R1 


NM 005012 


4826867 


4919 


D-0031 71-06 


oMooAAvjoo 1 AAAoACjO fa 




R0R1 


NM 005012 


4826867 


4919 


D-0031 71-07 


r^AriAPlPA APTXPATPTA A A 

oaoavjoaau 1 1 UA 1 o 1 AAA 


8 


R0R1 


NM 005012 


4826867 


4919 


D-0031 71 -08 


V3MOMA 1 o J 1 1:1 1 o 1 LrAAA 


C 


R0R2 














R0R2 


NM 004560 


19743897 


4920 


D-0031 72-05 


f5r^AAPTPmPTPPTr*/^r^TAT 
ooAAo 1 OoO 1 vjU 1 oOU ! AT 




ROR2 


NM 004560 


19743897 


4920 


D-0031 72-06 


ooAvJo 1 1 1 OAoA 1 v3 




R0R2 


NM 004560 


19743897 


4920 


D-0031 72-07 


P A ATr^TriPT APTT^T" A r^/^ A 

V30AA 1 V3 1 vjo 1 Ao 1 vj 1 AC/(3A 




R0R2 


NM 004560 


19743897 


4920 


D-0031 72-08 


fliA Ant APAfi A AX AX/^/^TT/^ A 

oaaVjALtAvj AA 1 A 1 oCjTTCA 




R0S1 














R0S1 


NM 002944 


19924164 


6098 


D-0031 73-05 


A 1^ A A PPXXPXX A /^XX A 

oAov^AoALrO 1 1 U 1 1 AOTTA 




R0S1 


NM 002944 


19924164 


6098 


D-0031 73-06 


T T A P A A rf^ xxr* A p A XX A 
1 1 Av-fAoAv3o 1 1 OAooA 1 1 A 


€ 


ROS1 


NM 002944 


19924164 


6098 


D-0031 73-07 


^^A Ar* A A APP>XA APO AX/^ A A 
U>AAUAAAV^V> 1 AAoUA 1 (jAA 




R0S1 


NM 002944 


19924164 


6098 


D-0031 73-08 


(nAAA(^Af^PAr*XXP A A ATA A 
V3AAAV3 AVjU»AO 1 1 U AAA 1 AA 


— 


RYK 














RYK 


NM 002958 


11863158 


6259 


D-0031 74-05 


^^A A Af^ AXr^i^XXAPPr^ A AX A 
oAAAoA 1 VjVj 1 1 ALrUoAA 1 A 




RYK 


NM 002958 


11863158 


6259 


D-0031 74-06 


PAA AntXA^ AXXPTr^ a a/^xx 
^✓MMAo 1 AvD A 1 1 U I oAAo 1 1 




RYK 


NM 00295a. 


1.1863158 


6259 


D-0031 74-07 


TP A PT A P/^PTPT AXP PXXX 


— 


RYK 


NM 002958 


11863158 


6259 


D-0031 74-08 


ril^Xr;! A A^/^ AXAXA/^r* A AX A 

oo 1 oAAoinA 1 A 1 AO>UAATA 


£ 


SRC 














SRC 


NM 005417 


21361210 


6714 


D-0031 75-05 


ntAr^A Ar*r*TPr*xr*xp/^ A a a /-» 
vjAoAALrLr 1 Vjvj 1 Kj 1 oLfAAAG 




SRC 


NM 005417 


21361210 


6714 


D-0031 75-09 


Ar^ Ar^A Appx<^r*xpxr*r*A a 

OMoAOAAOv^ 1 1 o 1 oLfAA 


-J 


SRC 


NM 005417 


21361210 


6714 


D-0031 75-10 


c^aA \ I rripx/^p a pxxxr^x 


SRC 


NM 005417 


21361210 


6714 


D-0031 75-11 


C^AAACXTC^AntAr^r^Ar^n A A An 
OMAAO 1 v^AoAUUAOoAAAC? 




SYK 












SYK 


NM 003177 


21361552 


6850 


D-0031 76-05 


GG A ATA ATPTP A A r2 A A Tr* a 

VJOAAM 1 MM 1 1 OAAo AA 1 L»A 




SYK 


NM 003177 


21361552 


6850 


D-0031 76-06 


HAAPTf^ (^(^PTPTrinTA AXX 


— 


SYK 


NM 003177 


21361552 


6850 


D-0031 76-07 


r^r^AA/^A AXPXP APr* A A AXX 
oOMAoAA 1 1 oAoUAAA 1 1 




SYK 


NM 003177 


21361552 


6850 


D-0031 76-08 


A AP A A r* Ax/^xr* A A p A X 

OMMOMOAWA 1 v3 1 LrAAooA 1 




TEC 














TEC 


NM 003215 


4507428 


7006 


D-0031 77-05 


A A ATTriTPT A CtT A A nTn A 
V3MMM i 1 o 1 O 1 Ao 1 AAvj 1 oA 




TEC 


NM 003215 


4507428 


7006 


D-0031 77-06 


P APPT^iA Ar^T^iXTTA AXXA 
OMOO I oMAo 1 o M 1 AA 1 1 A 


— 


TEC 


NM 003215 


4507428 


7006 


D-0031 77-07 


GTACAAAGTCGCAATCAAA 




TEC 


NM 003215 


4507428 


7006 


D-0031 77-08 


TGGAGGAGATTCTTATTAA 




TPK 














TEK 


NM_000459 


4557868 


7010 


D-0031 78-05 


GAAAGAATATGCCTCCAAA 





ii4 



TFK 


MM nnoATO 

i>livi 


fOO / OOO 




r\ AAO'i 70 AO 

L/-OO0I /8-Od 


GGAATGACATCAAATTTCA 


8 


TFK 




40DAO0O 


TA^ A 


U-003 178-07 


TGAAGTACCTGATATTCTA 


8 


TFK" 


IMIVI UUUnOy 


400/oDo 


7 A 4 A 


D-0031 78-08 


CGAAAGACCTACGTGAATA 


8 


TIF 














TIF 


MM C\f\RAOA 


AQ.Qt^f^'^n 


7A7C 


u-0031 79-05 J 


GAGAGGAGGl 1 1 ATGTGAA 


8 


TIF 
1 it— 


MM n(\'^A'?A 


4ooOOoU 




U-U031 79-06 


GGGACAGCCTCTACCCTTA 


8 


TIF 






7ATK 


U-OO01 79-07 


GAAGTTCTGTGCAAATTGG 


8 


TIF 

1 ILI 


MM CiTiCiAOA 


4ooODoU 


7A7C 


D-0031 79-08 


CAACATGGCCTCAGAACTG 


g 


TNK1 














TNK1 

1 INIX 1 


MM nn^QAR 


4DUr D lU 


074 A 


D-0031 80-05 


GTTCTGGGCCTAAGTCTAA 


g 




inivi uuoyoo 


yf CATC -I A 


o711 


D-0031 80-06 


GAACTGGGTCTACAAGATC 


g 




IMIvl uuoyoD 


4DU/D1U 


o71 1 


0-003180-07 


CGAGAGGTATCGGTCATGA 


g 




INIVI uuoyoo 


A CATf^-l A 


O/l 1 


D-0031 80-08 


GGCGGATCCTGGAGCATTA 


S 


I /\t\ 














TXK 


MM nn'^'^oft 


40U/ /4^ 


/zy4 


D-0031 81 -05 


GAACATCTATTGAGACAAG 


c 


TXK 


MM nn'^'^OA 

INIVI UUOO^O 


40U / /4^ 


/294 


D-0031 81 -06 


TCAAGGCACI 1 lATGAI 1 1 




TXK 


MM nn^'^Oft 

INIVI 


40U/ /4Z 


/ 48:94 


D-0031 81 -07 


GGAGAGGAATGGGTATATT 




TXK 

1 A.i\ 


INIVI uuoo^o 


4oU/ /4^ 


7294 


D-0031 81-08 


G6ATATATGTGAAGGAATG 




TYK9 














TYK9 


MM nn^^'^'i 

INIVI UUOoOl 


/IKA77/1Q 

4oU//4o 


7on7 


rx f\f\^A 

D-0031 82-05 


GAGGAGATCCACCACI 1 lA 




TYK9 


INIVI UUOOO 1 


ylCA77>l O 

4o07/4o 


7297 


D-0031 82-06 


GCATCCACATTGCACATAA 




TYK9 


MM n/T^QQi 
INm UUooo 1 


4o0774o 


7297 


D-0031 82-07 


TCAAATACCTAGGCACACT 


£ 


TYK9 


INIVI UUOOO 1 


450774o 


7297 


rx t\^f\A Aft AM 

D-0031 82-08 


CAATCTTGCTGACGTCTTG 




1 T r\WO 














TYRO? 


INIVI uuo^yo 




7301 


D-0031 83-05 


GGTAGAAGGTGTGCCATTT 


£ 


TYRO'^ 

1 T iaV-^O 


iNivi uuD^iyo 




7301 


D-0031 83-06 


AGGCTGAGATTTACAACTA 




TYRO*^ 


INIVI UUD/iyo 


^7597077 


7301 


D-0031 83-07 


GGATGGCTCCTTTGTGAAA 


c 

V 


TYRH*^ 


INIVI uuo^yo 


il/ 097077 


7301 


D-0031 83-08 


GAGAGGAACTACGAAGATC 




YF^1 














YF^1 


INIVI UU0400 


O-! AT-I A>l 4 


7525 


D-0031 84-05 


GAAGGACCCTGATGAAAGA 


£ 


T CO 1 


INIVI UU040o 


21071041 


7525 


D-0031 84-06 


TAAGAAGGGTGAAAGATTT 




YES1 


NM 005433 


21071041 


7525 


D-00'^1 84-07 


TP A A fi A A r^r'Tr^ A m AT A ATr^ 


r 


YES1 


NM 005433 


21071041 


7525 


D-0031 84-08 


CAGAATCCCTGeATGAATT 





Table Vm 



Gene 
Name 




"Gl 


Locus 
Link 


Duplex # 


Full Sequence 


SEQ. ![ 
NO. 


APC2 














APC2 


NM 013366 


7549800 


29882 


D-003200-05 


GCAAGGACCTCTTCATCAA 


921 


APC2 


NM 013366 


7549800 


29882 


D-003200-06 


GAGAAGAAGTCCACACTAT 


922 


APC2 


NM 013366 


7549800 


29882 


D-003200-07 


GGAATGCCATCTCCCAATG 


923 


APC2 


NM 013366 


7549800 


29882 


D-003200-09 


CAACACGTGTGACATCATC 


924 


ATM 














ATM 


NM 000051 


20336202 


472 


D-003201-05 


GCAAGCAGCTGAAACAAAT 


925 


ATM 


NM 000051 


20336202 


472 


D-003201-06 


GAATGMGCI 1 ICIGAAI 1 


926 


ATM 


NM 000051 


20336202 


472 


D-003201-07 


GACCTGAAGTCTTATTTAA 


927 


ATM 


NM 000051 


20336202 


472 


D-003201-08 


AGACAGAATTCCCAAATAA 


928 


ATR 














ATR 


NM 001184 


20143978 


545 


D-003202-05 


GAACAACACTGCTGGTTTG 


929 


ATR 


NM 001184 


20143978 


545 


D-003202-06 


GAAGTCATCTGTTCATTAT 


930 


ATR 


NM 001184 


20143978 


545 


D-003202-07 


GAAATAAGGTAGACTCAAT 


931 


ATR 


NM 001184 


20143978 


545 


D-003202-08 


CAACATAAATCCAAGAAGA 


932 


BTAK 















lis 



D 1 Mix 


MM nri'^finn 


ozlolyb 


6790 


D-003545-04 


CAAAGAATCAGCTAGCAAA 


933 




InIVI UUoDUU 


O/llOiyD 


6790 


D-003203-05 


GAAGAGAGTTATTCATAGA 


934 


RTAK 

LJ 1 r\l\ 


MM nn'^Ron 

INIVI UUOOUU 


ozioiyo 


6790 


D-003203-07 


CAAATGCCCTGTCTTACTG 


935 


o 1 r\u 


IMIVl UUOOUU 


Qo-i o^ ne 

OZ loiyo 


6790 


D-003203-09 


TCTCGTGACTCAGCAAATT 


936 


PPMA1 














PPWA1 


INI VI uuoyi4 


1 boObozo 


8900 


D-003204-05 


GAACCTGGCTAAGTACGTA 


937 




IMIVI UUoyi4 


1 boOoozo 


8900 


D-003204-06 


GCAGATCCATTCTTGAAAT 


938 




INIVI uuoy 14 


1 boOb52o 


8900 


D-003204-07 


TCACAAGAATCAGGTGTTA 


939 




IMIvl UUoy 14 


1 b30o528 


8900 


D-003204-08 


CATAAAGCGTACCTTGATA 


940 


PPM A 9 














PPMA9 


KitVA nn4 00*7 


1 oyoOoDo 


890 


D-003205-05 


GCTGTGAACTACATTGATA 


941 


PPM A 9 


NM Uu1z37 


16950653 


890 


D-003205-06 


GATGATACCTACACCAAGA 


942 


PPMA9 


INIVI UUi^o/ 


10950653 


890 


D-003205-07 


GCTGTTAGCCTCAAAGTTT 


943 


PPNA9 


MM nn*! 007 
INM yjVi^of 


16950653 


890 


D-003205-08 


AAGCTGGCCTGAATGATTA 


944 


PPMR1 














PPMR1 


iMIvl UoiyuD 


14327895 


891 


D-003206-05 


CAACATTACCTGTCATATA 


945 


PPWR1 


iNivi Uoiybb 


14327895 


891 


D-003206-06 


CCAAATACCTGATGGAACT 


946 . 


PPMR1 


INIVI uoiyoD 


14327895 


891 


D-003206-07 


GAAATGTACCCTCCAGAAA 


947 


PPMR1 


INM uoiyou 


14327895 


891 


D-003206-08 


GCACCTGGCTAAGAATGTA 


948 


PP MR9 














PPMR9 


iNm UU4/U1 


10938017 


9133 


D-003207-05 


CAACAAATGTCAACAAACA 


949 


PPMR9 


rim UU4/U1 


10938017 


9133 


D-003207-06 


GCAGCAAACTCCTGAAGAT 


950 


PPMR9 


KIR if f\f\A'7i\A 

InM UU4f01 


10938017 


9133 


D-003207-07 


CCAGTGATTTGGAGAATAT 


951 


PPKIR9 


rim UU4/01 


10938017 


9133 


D-003207-08 


GTGACTACGTTAAGGATAT 


952 


PPMR*^ 
















INM KJOoUoi 


14719419 


85417 


D-003208-05 


TGAACAAACTGCTGACTTT 


953 




IMIVI UooUol 


14719419 


85417 


D-003208-06 


GCTAGCTGCTGCCTCCTTA 


954 




INIVI UooUoi 


14719419 


85417 


D-003208-07 


CAACTCACCTCGTGTGGAT 


955 


PPMR? 


INIVI UooUol 


14719419 


85417 


D-003208-08 


GTGGATCTCTACCTAATGA 


956 


PPMP 














PPMP 


INIVI UUDiyU 


7382485 


892 


D-003209-05 


GCAGAGCTCCCACTATTTG 


957 


PPMP 


INIVI uuoiyu 


7382485 


892 


D-003209-06 


GGAGTAGTTTCAAATACAA 


958 


PPMP 


Nivi uuoiyu 


7382485 


892 


D-003209-07 


GACCTTTGCTCCAGTATGT 


959 


PPMP 


iNivi_^uuoiyu 


7382485 


892 


D-003209-08 


GAGATTCTATGCCAGGTAT 


960 


ppMni 














PPMHI 


iNlVf UdoUDO 


16950654 


595 


D-003210-05 


TGAACAAGCTCAAGTGGAA 


961 




NM UooOOD 


16950654 


595 


D-003210-06 


CCAGAGTGATCAAGTGTGA 


962 


PPMHi 


INIVI UOOUOD 


16950654_ 


595 


D-003210-07 


GTTCGTGGCCTCTAAGATG 


963 




InM udoUod 


16950654 


595 


D-003210-08 


CCGAGAAGCTGTGCATCTA 


964 


PP Mn9 














PP Mn9 


iNM UUj /oy 


1 b950656 


894 


D-003211-06 


TGAATTACCTGGACCGTTT 


965 


pPMn9 


VIM nrn7cn 
INM UUi/oy 


1 6950656 


894 


D-003211-07 


CGGAGAAGCTGTGCATTTA 


966 


ppMn9 


INM uui/oy 


16950656 


894 


D-003211-08 


CTACAGACGTGCGGGATAT 


967 


ppMn9 


NM UUT/oy 


1 6950656 


894 


D-003211-09 


CAACACAGACGTGGATTGT 


968 


PPMH*^ 














pPMn*^ 


ViRil HA-I 7£5n 

INM UU 1 /oU 


1 6950657 


896 


D-003212-05 


GGACCTGGCTGCTGTGATT 


969 




INIVI UU 1 /OU 


1 b950657 


896 


D-003212-06 


GATTATACCTTTGCCATGT 


970 


PPMR*^ 


NM UUl/oU 


16950657 


896 


D-003212-07 


GACCAGCACTCCTACAGAT 


971 


PPMH*^ 


INM UU1 /OO 


16950657 


896 


D-003212-08 


TGCGGAAGATGCTGGGTTA 


972 


PPMF1 














PPNP1 


NIVI UU I^OO 


1 731 8558 


898 


D-003213-05 


GTACTGAGCTGGGCAAATA 


973 


PPMPI 


INM UUIidoO 


17318558 


898 


D-003213-06 


GGAAATCTATCCTCCAAAG 


974 


PPMF1 


INIVI UUliCOO 


4 TO .4 O ^ ^ 

17318558 


898 


D-0032 13-07 


GGAGGTGTGTGAAGTCTAT 


975 


PPMF1 


MM nn*! OQO 
INM UUl^lOO 


H TO >l o ^ r"i^ 

1 731 8558 


898 


D-003213-08 


CTAAATGACTTACATGAAG 


976 


CCNE2 














CCNE2 


NM 057749 


17318564 


9134 


D-003214-05 


GGATGGAACTCATTATATT 1 


977 



no 



CCNE2 


NM 057749 


17318564 


9134 


D-003214-06 


GCAGATATGTTCATGACAA 


978 


CCNE2 


NM 057749 


17318564 


9134 


D-003214-07 


CATAATATCCAGACACATA 


979 


CCNE2 


NM 057749 


17318564 


9134 


D-003214-08 


TAAGAAAGCCTCAGGTTTG 


980 


CCNF 














CCNF 


NM 001761 


4502620 


899 


D-003215-05 


TCACAAAGCATCCATATTG 


981 


CCNF 


NM 001761 


4502620 


899 


D-003215-06 


GAAGTCATGTTTACAGTGT 


982 


CCNF 


NM 001761 


4502620 


899 


D-003215-07 


TAGCCTACCTCTACAATGA 


983 


CCNF 


NM 001761 


4502620 


899 


D-003215-08 


6CACCCGGTTTATCAGTAA 


984 


CCNG1 














CCNG1 


NM 004060 


8670528 


900 


D-003216-05 


GATAATGGCCTCAGAATGA 


985 


CCNG1 


NM 004060 


8670528 


900 


D-003216-06 


GCACGGCAATTGAAGCATA 


986 


CCNG1 


NM 004060 


8670528 


900 


D-003216-07 


GGAATAGAATGTCTTCAGA 


987 


CCNG1 


NM 004060 


8670528 


900 


D-003216-08 


TAACTCAGCTTCCAACAAT 


988 


CCNG2 














CCNG2 


NM 004354 


4757935 


901 


D-003217-05 


G6AGAGAGTTGGTTTCTAA 


989 


CCNG2 


NM 004354 


4757935 


901 


D-003217-06 


GGTGAAACCTAAACATTTG 


990 


^^^^^ t^^#^ 

CCNG2 


NM 004354 


4757935 


901 


D-003217-07 


GAAATACTGAGCCTTGATA 


991 


CCNG2 


NM 004354 


4757935 


901 


D-003217-08 


TGCCAAAGTTGAAGATTTA 


992 


CCNH 














CCNH 


NM 001239 


17738313 


902 


D-003218-05 


GCTGATGACTTTCTTAATA 


993 


CCNH 


NM 001239 


17738313 


902 


D-003218-06 


CAACTTAATTTCCACCTTA 


994 


CCNH 


NM_001239 


17738313 


902 


D-003218-07 


ATACACACCTTCCCAAATT 


995 


CCNH 


NM 001239 


17738313 


902 


D-003218-08 


GCTATGAAGATGATGATTA 


996 


CCNI 














CCNI 


NM 006835 


17738314 


10983 


D-003219-05 


GCAAGCAGACCTCTACTAA 


997 


CCNI 


NM 006835 


17738314 


10983 


D-003219-07 


TGAGAGAATTCCAGTACTA 


998 


CCNI 


NM 006835, 


17738314 


10983 


D-003219-08 


GGAATCAAACGGCTCTATA 


999 


CCNI 


NM 006835 


17738314 


10983 


D-003219-09 


GAATTGGGATCTTCACACA 


1000 


CCNT1 














CCNT1 


NM 001240 


17978465 


904 


D-003220-05 


TATCAACACTGCTATAGTA 


1001 


CCNT1 


NM 001240 


17978465 


904 


D-003220-06 


GAACAAACGTCCTGGTGAT 


1002 


CCNT1 


NM 001240 


17978465 


904 


D-003220-07 


GCACAAGACTCACCCATCT 


1003 


CCNT1 


NM 001240 


17978465 


904 


D-003220-08 


GCACAGACTTCTTACTTCA • 


f1004 


CCNT2A 














CCNT2A 


NM 001241 


17978467 


905 


D-003221-05 


GCACAGACATCCTATTTCA 


1005 


CCNT2A 


NM 001241 


17978467 


905 


D-003221-06 


GCAGGGACCTTCTATATCA 


1006 


CCNT2A 


NM 001241 


17978467 


905 


D-003221-07 


GAACAGCTATATTCACAGA 


1007 


CCNT2A 


NM 001241 


17978467. 


505 


D-003221-09 


TTATATAGCTGCCCAGGTA 


1008 


CCNT2B 














CCNT2B 


NM 058241 


17978468 


905 


D-003222-05 


GCACAGACATCCTATTTCA 


1009 


CCNT2B 


NM 058241 


17978468 


905 


D-003222-06 


GCAGGGACCTTCTATATCA 


1010 


CCNT2B 


NM 058241 


17978468 


905 


D-003222-07 


GAACAGCTATATTCACAGA 


1011 


CCNT2B 


NM 058241 


17978468 


905 


D-003222-08 


6GTGAAATGTACCCAGTTA 


1012 


CDC16 














CDC16 


NM 003903 


14110370 


8881 


D-003223-05 


GTAGATGGCTTGCAAGAGA 


1013 


CDC16 


NM 003903 


14110370 


8881 


D-003223-06 


TAAAGTAGCTTCACTCTCT 


1014 


CDC16 


NM 003903 


14110370 


8881 


D-003223-07 


GCTACAAGCTTACTTCTGT 


1015 


CDC16 


NM 003903 


14110370 


8881 


D-003223-08 


TGGAAGAGCCCATCAATAA 


1016 


CDC2 














CDC2 


NM 033379 


27886643 


983 


D-003552-01 


GTACAGATCTCCAGAAGTA 


1017 


CDC2 


NM 033379 


27886643 


983 


D-003552-02 


GATCAACTCTTCAGGATTT 


1018 


CDC2 


NM 033379 


27886643 


983 


D-003552-03 


GGTTATATCTCATCTTTGA 


1019 


CDC2 


NM 033379 


27886643 


983 


D-003652-04 


GAACTTCGTCATCCAAATA 


1020 


CDC20 














CDC20 


VIM 001255 


^\J\j 1 HOD 


yyi 


U-OUo^ilO-05 


GGGAATATATATCCTCTGT 


1021 . 


CDC20 


NM 001255 


4557436 


991 


D-003225-06 


GAAACGGCTTCGAAATATG 


1022 



117 



CDC20 


NM 001255 


4557436 


991 


D-003225-07 


GAAGACCTGCCGTTACATT 


1023 


CDC20 


NM 001255 


4557436 


991 


D-003225-08 


CACCAGTGATCGACACATT 


1024 


CDC25A 














CDC25A 


NM 001789 


4502704 


993 


D-003226-05 


GAAATTATGGCATCTGTTT 


1025 


CDC25A 


NM 001789 


4502704 


993 


D-003226-06 


TACAAGGAGTTCTTTATGA 


1026 


CDC25A 


NM 001789 


4502704 


993 


D-003226-07 


CCACGAGGACTTTAAAGAA 


1027 


CDC25A 


NM 001789 


4502704 


993 


D-003226-08 


TGGGAAACATCAGGATTTA 


1028 


CDC25B 














CDC25B 


NM 004358 


11641416 


994 


D-003227-05 


GCAGATACCCCTATGAATA 


1029 


CDC25B 


NM 004358 


11641416 


994 


D-003227-06 


CTAGGTCGCTTCTCTCTGA 


1030 


CDC25B 


NM 004358 


11641416 


994 


D-003227-07 


GAGAGCTGATTGGAGATTA 


1031 


CDC25B 


NM 004358 


11641416 


994 


D-003227-08 


AAAAGGACCTCGTCATGTA 


1032 


CDC25C 














CDC25C 


NM 001790 


12408659 


995 


D-003228-05 


GAGCAGAAGTGGCCTATAT 


1033 


CDC25C 


NM 001790 


12408659 


995 


D-003228-06 


CAGAAGAGATTTCAGATGA 


1034 


CDC25C 


NM 001790 


12408659 


995 


D-003228-07 


CCAGGGAGCCTTAAACTTA 


1035 


CDC25C 


NM 001790 


12408659 


995 


D-003228-08 


GAAACTTGGTGGACAGTGA 


1036 


CDC27 














CDC27 


NM 001256 


16554576 


996 


D-003229-06 


CATGCAAGCTGAAAGAATA 


1037 


CDC27 


NM 001256 


16554576 


996 


D-003229-07 


CAACACAAGTACCTAATCA 


1038 


CDC27 


NM 001256 


16554576 


996 


D-003229-08 


GGAGATGGATCCTATTTAC 


1039 


CDC27 


NM 001256 


16554576 


996 


D-003229^9 


GAAAAGCGATGATGATATT 


1040 


CDC34 














CDC34 


NM 004359 


16357476 


997 


D-003230-05 


GCTCAGACCTCTTCTACGA 


1041 


CDC34 


NM 004359 


16357476 


997 


D-003230-06 


GGACGAGGGGGATCTATAG 


1042 


CDC34 


NM 004359 


16357476 


997 


D-003230-07 


GATCGGGAGTAGACAGACA 


1043 


CDC34 


NM 004359 


16357476 


997 


D-003230-08 


TGAACGAGCCCAACACCTT 


1044 


CDC37 














CDC37 


NM 007065 


16357478 


11140 


D-003231-05 


GCGAGGAGACAGCCAATTA 


1045 


CDC37 


NM 007065 


16357478 


11140 


D-003231-06 


CACAAGACCTTCGTGGAAA 


1046 


CDC37 


NM 007065 


16357478 


11140 


D-003231-07 


ACAATCGTCATGCAATTTA 


1047 


CDC37 


NM 007065 


16357478 


11140 


D-003231-08- 


GAGGAGAAATGTGCACTCA 


1048 


CDC45L 














CDC45L 


NM 003504 


34335230 


8318 


D-003232-05 


GCACACGGATCTCCTTTGA 


1049 


CDC45L 


NM 003504 


34335230 


8318 


D-003232-06 


GCAAACACCTGCTCAAGTC 


1050 


CDC45L 


NM 003504 


34335230 


8318 


D-003232-07 


TGAAGAGTCTGCAAATAAA 


1051 


CDC45L 


NM 003504 


34335230 


8318 


D-003232-08 


GGACGTGGATGGTCTGTGT 


1052 


CDC6 














CDC6 


NM 001254 


16357469 


9910' 


D-003233-05 


GAACAGAGCTGTCCGAGAT 


1053 


CDC6 


NM 001254 


16357469 


990 


D-003233-06 


GAGCAGAGATGTCCACTGA 


1054 


CDC6 


NM 001254 


16357469 


990 


D-003233-07 


GGAAATATCTTAGCTACTG 


1055 


CDC6 


NM 001254 


16357469 


990 


D-003233-08 


GGACGAAGATTGGTATTTG 


1056 


CDC7 














CDC7 


NM 003503 


11038647 


8317 


D-003234-05 


GGAATGAGGTACCTGATGA 


1057 


CDC7 


NM 003503 


11038647 


8317 


D-003234-06 


CAGGAAAGGTGTTCAGAAA 


1058 


CDC7 


NM 003503 


11038647 


8317 


D-003234-07 


CTACACAAATGCAGAAATT 


1059 


CDC7 


NM 003503 


11038647 


8317 


D-003234-08 


GTACGGGAATATATGCTTA 


1060 


CDK10 














CDK10 


NM 003674 


32528262 


8558 


D-003235-05 


GAACTGCTGTTGGGAACCA 


1061 


CDK10 


NM 003674 


32528262 


8558 


D-003235-06 


GGAAGCAGCCCTAGAACAA 


1062 


CDK10 


NM 003674 


32528262 


8558 


D-003235-07 


GCACGCCCAGTGAGAACAT 


1063 


CDK10 


NM 003674 


32528262 


8558 


D-003235-08 


GGAAGCAGCCCTAGAACAA 


1064 


CDK2 














CDK2 


NM 001798 


16936527 


1017 


D-003236-05 


GAGCTTAACGATCCTAATA 


1065 


CDK2 


NM 001798 


16936527 


1017 


D-003236-06 


GAGCTTAACCATCCTAATA 


1066 


CDK2 


NM 001798 


16936527 


1017 


D-003236-07 


GTACCGAGCTCCTGAAATC 


1067 



m 



CDK2 


NM 001798 


1o93o527 


A r\A "7 

1017 


D-003236-08 


GAGABGTGGTGGCGCTTAA 


1068 


CDK3 














CDK3 


NM 001258 


4557438 


1018 


D-003237-05 


GAGCATTGGTTGCATC 1 1 1 


1069 


CDK3 


NM 001258 


4557438 


1018 


D-003237-06 


GATCGGAGAGGGCACCTAT 


1070 


CDK3 


NM 001258 


4557438 


1018 


J-003237-07 


GAAGCTCTATCTGGTG 1 1 1 


1071 


CDK3 


NM 001258 


4557438 


1018 


D-003237-08 


GCAGAGATGGTGACTCGAA 


1072 


CDK4 














CDK4 


NM 000075 


A cr* A or* 

456426 


1019 


D-003238-05 


GCAGCACTCTTATCTACAT 


1073 


CDK4 


NM 000075 


456426 


1019 


D-003238-06 


GGAGGAGGCCTTCCCATCA 


1074 


CDK4 


Kin A A#%/\A'7C? 

NM 000075 


A IT f> A or> 

456426 


1019 


D-003238-07 


TCGAAAGCCTCTCTTCTGT 


1075 


CDK4 


NM 000075 


456426 


1019 


D-003238-08 


^^^^ A ^^^^^^ Jft ^^^^^^ .^^^^ Jfe jh ^■^d^MM* 

GTACCGAGCTCCGGAAGTT 


1076 


CDK5 














CDK5 


NM 004935 


4826674 


1020 


D-003239-05 


TGACCAAGCTGCCAGACTA 


1077 


CDK5 


NM 004935 


4826674 


1020 


D-003239-06 


GAGCTGAAATTGGCTGATT 


1078 


CDK5 


NM 004935 


4826674 


1020 


D-003239-07 


CAACATCCCTGGTGAACGT 


1079 


CDK5 


NM 004935 


4826674 


1020 


D-003239-08 


GGATTCCCGTCGGCTGTTA 


1080 


CDK6 














CDK6 


NM 001259 


16950658 


1021 


D-003240-05 


GCAAAGACCTACTTCTGAA 


1081 


CDK6 


NM 001259 


16950658 


1021 


D"003240-06 


GAAGAAGACTGGCCTAGAG 


1082 


CDK6 


Kilt A e\f\A^FfX 

NM 001259 


16950658 


1021 


D-003240-07 


GGTCTGGAGTTTCTTCATr 


1083 


CDK6 


K IR A r\t\A 

NM 001259 


16950658 


1021 


D-003240-08 


TAACAGATATCGATGAACT 


1084 


CDK7 














CDK7 


NM 001799 


16950659 


1022 


D-003241-05 


A A JK A A A A ■■■■■■ JK 

GGACATAGATCAGAAGCTA 


1085 


CDK7 


NM 001799 


16950659 


1022 


D-003241-06 


CAATAGAGCTTATAGACAT 


1086 


CDK7 


NM 001799 


16950659 


1022 


D-003241-07 


CATACAAGGCTTATTCTTA 


1087 


CDK7 


NM 001799 


16950659 


1022 


D-003241-08 


GGAGACGACTTACTAGATC 


1088 


CDK8 














CDK8 


NM 001260 


4502744 


1024 


D-003242-05 


CCACAGTACTCACATCAGA 


1089 


CDK8 


NM 001260 


4502744 


1024 


D-003242-06 


GCAATAACCACAGTAATGG 


1090 


CDK8 


NM 001260 


4502744 


1024 


D-003242-07 


GAAGAAAGTGAGAGTTGTT 


1091 


CDK8 


NM 001260 


4502744 


1024 


D-003242-08 


GAACATGACGTCTGGCATA 


1092 


CDK9 














CDK9 


NM 001261 


17017983 


1025 


D-003243-05 


GGCCAAACGTGGACAACTA 


1093 


CDK9 


NM 001261 


17017983 


1025 


D-003243-06 


TGACGTCCATGTTCGAGTA 


1094 


CDK9 


NM 001261 


17017983 


1025 


D-003243-07 


UC;AAUUAC3AUt3C3AC3 1 I 1 (3A 


1095 


CDK9 


NM 001261 


17017983 


1025 


D-003243-08 


GAAGGTGGCTCTGAAGAAG 


1096 


CDKN1C 














CDKN1C 


NM 000076 


4557440 


1028 - 


D-0.03244-05 


GACCAGAACCGCTGGGATT 


1097 


CDKN1C 


NM 000076 


4557440 


1028 


D-003244-06 


GGACCGAAGTGGACAGCGA 


1098 


CDKN1C 


NM 000076 


4557440 


1028 


D-003244-08 


GCAAGAGATCAGC6CCTGA 


1099 


CDKN1C 


Bk IB A ^^r\^~»^ 

NM 000076 


4557440 


1028 


D-003244-09 


CCGCTGGGATTACGACTTG 


1100 


CDKN2B 














CDKN2B 


NM 004936 


17981693 


1030 


D-003245-05 


GCGAGGAGAACAAGGGCAT 


1101 


CDKN2B 


NM 004936 


17981693 


1030 


D-003245-06 


UUAAUtiUAU 1 UAAUUIJ 1 1 1 


1102 


CDKN2B 


NM 004936 


17981693 


1030 


D-003245-07 


CGATCCAGGTCATGATGAT 


1103 


CDKN2B 


NM 004936 


17981693 


1030 


D-003245-08 


CCTGGAAGCCGGCGGGGAT 


1104 


CDKN2C 














CDKN2C 


NM 001262 


17981697 


1031 


D-003246-05 


GGACACCGCCTGTGATTTG 


1105 


CDKN2C 


NM 001262 


17981697 


1031 


D-003246-06 


GCCAGGAGACTGCTACTTA 


1106 


CDKN2C 


NM 001262 


17981697 


1031 


D-003246-07 


TGAAAGACCGAACTGGTTT 


1107 


CDKN2C 


NM 001262 


17981697 


1031 


D-003246-08 


GAACCTGCCCTTGCACTTG 


1108 


CDKN2D 














CDKN2D 


NM 001800 


17981700 


1032 


D-003247-05 


TGGCAGTTCAAGAGGGTCA 


1109 


CDKN2D 


KIRS nn^A^A 

NM 001800 


17981700 


1032 


D-003247-06 


CTCAGGAGCTCGTGGACAT 


1110 




NIvl UUioUU 


1 / 981700 


1032 


D-003247-07 


1 GAAGGTCCTAGTGGAGCA 


A A A A 

1111 


CDKN2D 


NM 001800 


17981700 


1032 


D-003247-08 


AGACGGCGCTGCAGGTCAT 


1112 



119 



CDT1 
CDTl 
CDT1 
CDT1 
CDT1 

CENPA 


NM 030928 
NM 030928 
NM_030928 
NM 030928 

NM 001809 


19923847 
19923847 
19923847 
19923847 

4585861 


81620 
81620 
81620 
81620 

1058 


13-003248-05 
D-003248-06 
D-003248-07 
D-003248-08 

D-003249-05 


CCAAGGAGGCACAGAARHA 
GCTTCAACGTGGATGAAGT 
TCTCCGGGCCAGAAGATAA 
GCGCAATGTTGGCCAGATC 

GCAPAPAPPTPTTn ax a Ar* 


1113 
1114 
1115 
1116 

1117 


CENPA 

PPMPA 


NM 001809 

rim UUioUy 


4585861 

4585861 


1058 

1058 


D-003249-06 


GCAAGAGAAATATGTGTTA 


1118 


PPMDA 


Nivl 001 809 


4585861 


1058 


D-003249-07 
D-003249-08 


TTACATGCAGGCCGAGTTA 
GAGACAAGGTTGGCTAAAG 


1119 
1120 


PPMDR 
















NM 001810 


26105977 


1059 


D-003250-05 


GGACATAGCCGCCTGCTTT 


1121 


PPMPR 


Nm UOI0IO 


26105977 


1059 


D-003250-06 


GCACGATCCTGAAGAACAA 


1122 


CENPB 


NM 001810 


26105977 


1059 


D-003250-07 


GGAGGAGGGTGATGTTGAT 


1123 


PPM DP 

CENPC1 


NM 001810 


26105977 


1059 


D-003250-08 


CCGAATGGCTGCAGAGTCT 


1124 




NM L)Ulo12 
NM 001 81 2 


4502778 
4502778 


1060 
1060 


D-003251-05 
D-003251-06 


GCGAATAGATTATCAAGGA 
GAACAGAATCCATCACAAA 


1125 
1126 


CENPC1 


NM .001812 


4502778 


1060 


D-003251-07 


CCATAAACCTCACCCAGTA 


1127 


CENPC1 
CENPE 
CENPE 
CENPE 

CENPE 


NM 001812 

NM 001813 
NM 001813 

NM 001813 
NM 001813 


4502778 

4502780 
4502780 
4502780 
4502780 


1060 

1062 
1062 
1062 
1062 


D-003251-08 

D-003252-05 
D-003252-06 
D-003252-07 
D-003252-08 


CAAGAGAACACGTTTGAAA 

GAAGACAGCTCAAATAATA 
CAACAAAGCTACTAAATCA 
GGAAAGAAGTGCTACCATA 
GGAAAGAAATGACACAGTT 


1128 

1129 
1130 
1131 
1132 


CENPF 
CENPF 
CENPF 
CENPF 

CENPH 


NM 016343 
NM 016343 
NM 016343 
NM_ 016343 


14670380 
14670380 
14670380 
14670380 


1063 
1063 
1063 
1063 


D-G03253-05 
D-003253-06 
D-003253-07 
D-003253-08 


GCGAATATCTGAATTAGAA 
GGAAATTAATGCATCCTTA 
GAGCGAGGCTGGTGGTTTA 
CAAGTCATCTTTCATCTAA 


1133 
1134 
1135 
1136 


CENPH 
CENPH 


NM 022909 
NM 022909 


21264590 
21264590 


64946 
64946 


D-003254-05 


GAAAGAAGAGATTGCAATT 


1137 


CENPH 

PPMDW 


NM 022909 
NM 022909 


21264590 
21264590 


64946 
64946 


D-003254-06 
D-003254-07 


CTAGTGTGCTCATGGATAA 


1138 
1139 


PWPWi 








D-003254-08 


GAAACACCTATTAGAGCTA 


1140 




vim 001274 


20127419 


1111 


D-003255-05 


CAAATTGGATGCAGACAAA 


1141 


CHEK1 


NM 001274 


20127419 


1111 


D-003255-06 


GCAACAGTATTTCGGTATA 


1142 


CHEK1 


NM 001274 
NM 001274 


20127419 
20127419 


1111 
1111 


D-00i3255^7 
D^03255i08 


GGACTTCTCTCCAGTAAAC 
AAAGATAGATGGTACAACA 


1143 
1144 


CHEK2 
CHEK2 
CHEK2 


NM 007194 
NM 007194 


22209010 
22209010 


11200 
11200 


D-003256-02 


CTCTTACATTGCATACATA 


1145 


CHEK2 
CHEK2 
CNK 


NM 007194 
NM 007194 


22209010 
22209010 


11200 
11200 


D-003256-03 
D-003256-04 
D-003256-05 


TAAACGCCTGAAAGAAGCT 
GAAATTGCACTGTCACTAA 


1146 
147 
1148 


CNK 
CNK 


NM 004073 
NM 004073 

NM 004073 


4758015 
4758015 
475801 5 


1263 
1263 
1263 


D-003257-O5 
D-003257-07 


f^nri Ar^A Ari ATr*r^T A A AT"^ A 
GCAAGTGGGTTGACTACTC 


1149 
1150 


PMk' 


NM 004073 


4758015 


1263 


D-003257-08 
D-003257-09 


GCACATCCGTTGGCCATCA 
GACCTCAAGTTGGGAAATT 


1151 
1152 


PRI1 














PRM 


NM 014335 


7656937 


23741 


D-003258-05 


GTGATGAGATTATTGATAG 


1153 


PRH 


NM 014335 


7656937 


23741 


D-003258-06 


GGACGAGGGCGAGGAATTT 


1154 


PRM 


NM Ul4o3o 


7656937 


23741 


D-003258-07 


GGAAACGGAGCCTTGCTAA 


1155 


CRI1 


NM 014335 


7656937 


23741 


D-003258-08 


TCAATCGTCTGACCRAARA 


1156 


E2F1 








1 





E2F1 


NM 005225 


12669910 


18RQ 


u-uuOidoy-'UD 


OAACAGGGCCACTGACTCT 


1157 


E2F1 


NM 005225 


12669910 


18RQ 


u-uuo^oy-uo 


TGGACCACCTGATGAATAT 


1158 


E2F1 


NM 005225 


12669910 


IftRQ 


n nn'^OKQ n7 


CrOCAGGAGGTCACTTCTGA 


1159 


E2F1 


NM 005225 


1266QQin 




U-UUoZoy-Uo 


GGCTGGACCTGGAAACTGA 


1160 


E2F2 














E2F2 


NM 004091 




1R70 


U-UUOiiOU-Oo 


GGGAGAAGACTCGGTATGA 


1161 


E2F2 


NM 004091 


3448S718 


1ft70 
1 Of u 


n nnooftA ac 


GAGGACAACCTGCAGATAT 


1162 


E2F2 


NM 004091 


34485718 


1870 


H-OO'^Oftn r^7 


1 GAAGGAGCTGATGAACAC 


1163 


E2F2 


NM 004091 


34485718 


1ft70 


r> nnQocA aq 


CCAAGAAGTTCATTTACCT 


1164 


E2F3 














E2F3 


NM 001949 


1266QQ1'^ 


1ft71 
lOr 1 


r> nnooft'i ac 
U-UUOiiD i-Oo 


GAAATTAGATGAACTGATC 


1165 


E2F3 


NM 001949 




1ft71 
lOf 1 


p^ nAooci4 AC 
U-UUozbl-Ub 


TGAAGTGCCTGACTCAATA 


1166 


E2F3 


NM 001949 


IPfifiQQI*^ 


1ft71 
■ Of 1 


nnooc'i at 
u-UUoZdI-U/ 


GAACAAGGCAGCAGAAGTG 


1167 


E2F3 


NM 001949 


1 26RQQ1 ^ 


1A71 
lOf 1 


r\ AAQOCSH AO 


GAAACACACAGTCCAATGA 


1168 


E2F4 














E2F4 


NM 001950 




lOi H 


P\ AA*aO£50 AC 


GGAGATTGCTGACAAACTG 


1169 


E2F4 


NM 001950 






AAQO^iO AC 


GAAGGTATCGGGCTAATCG 


1170 


E2F4 


NM 001950 




1R7A 


n AAOOCO AT 


GTGCAGAAGTCCAGGGAAT 


1171 


E2F4 


NM 001950 


1266QQ14 


iOi H 


n AAQOCO AO 


GGACAGTGGTGA6CTCAGT 


1172 


E2F5 














E2F5 


NM 001951 




lo/ 0 


Pi AAOOCO AC 

U-UUOiibo-Oo 


GCAGATGACTACAACTTTA 


1173 


E2F5 


NM 0019^)1 




ift7c; 


U-0032d3-06 


GACATCAGCTACAGATATA 


1174 


E2F5 


NM 001951 


12fiRQQ1R 


Ift7c; 

1 0/ o 


n fiAootfso A"7 


CAACATGTCTCTGAAAGAA 


1175 


E2F5 


NM 001951 




lo/ D 


r\ AAOOCO AO 

U-UUOiloo-Oo 


GAAGACATCTGTAATTGGT 


1176 


E2F6 














E2F6 


NM 001952 


1 2R6QQ1 7 


1 Of O 


Pi AAQOOI AC 


TAAACAAGG7TGCAACGAA 


1177 


E2F6 


NM 001 QSP 


1 9RRQQi 7 


lO/O 


!J-uu32d4-06 


TAGCATATGTGACCTATCA 


1178 


E2F6 


NM 001952 




iO/O 


r\ AAOOOy! A"7 

U-Uuo2d4-07 


GAAACCAGATTGGATGTTC 


1179 


E2F6 


NM 001 QS? 




10/ D 


U-U0o2d4-09 


GGAACTTTCTGACTTATCA 


1180 


FOS 














FOS 


NM 005252 






r\ AAOOCC AC 

U-UUo2bo-U5 


GGGATAGCCTCTCTTACTA 


1181 


FOS 


NM 005252 






U-UUoZo5-0d 


GAACAGTTATCTCCAGAAG 


1182 


FOS 


NM 005252 






r\ AAOOCC A^' 


GGAGACAGACCAACTAGAA 


1183 


FOS 


NM 005252 






r\_AAOO£5C AO 


AGACCGAGCCGTTTGATGA 


1184 


H1PK2 














HIPK2 


NM 022740 






U-Uuo2do-0d 


GAGAATCACTCCAATCGAA 


1185 


HIPK2 


NM 022740 






u-UUo2bD-07 


AGACAGGGATTAAGTCAAA 


1186 


HIPK2 


NM 022740 






U-UUoZdd=Qo 


GGACAAAGACAACTAGGTT 


1187 


mPK2 


NM 022740 




OOQQR 


r% AAOOCC r\'f\ 


GCACACACGTCAAATCATG 


1188 


HUS1 














HUS1 


NM 004507 


'^l 07791 


OOD'I" 


U-Uuo2b/-05 


ACAAAGGGCTTATGCAATA 


1189 


HUS1 


NM 004507 


'^l 07791 




U-UUozb/-uD 


GAAGTGCACATAGATATTA 


1190 


HUS1 


NM 004507 

i ^IVI \J\J^\J\J I 


'^l 07791 




U-UUo2b r-07 


AAGCTTAACTTCATCCTTT 


1191 


HUS1 


NM 004507 


3107791*^ 


Ov>D*t 


~1 AAQOCT AO 

L/-UU02b # -Uo 


GAACTTCTTCAACGAATTT 


1192 


JUN 














JUN 


NM 002228 


7710199 




AAOOCO AC 

U-UUoZb<5-Uo 


TGGAAACGACCTTCTATGA 


1193 


JUN 


NM 002228 


7710199 


Or AO 


nn^oi^o AC 
U-UUo^Do-Ub 


GAACTGCACAGCCAGAACA 


1194 


JUN 


NM 002228 


/ r 1 U I 


Or 


AAQOCO A^ 

j-UUoZoo-07 


GAGCTGGAGCGCCTGATAA 


1195 


JUN 


NM 002228 


7710199 

f r IvF 


Of 


"^ AA^OAD AO 


TAACGCAGCAGTTGCAAAC 


1196 


JUNB 














JUNB 


VJM 00222Q 

1 >f ivi \J\J c,e. C-.\J 


4^0AAn« 

*TiJU*TOUO 


^^70^ 
O/ilD 


"\ AAOOCA AC 

U-Uuo2by~0o 


GCATCAAAGTGGAGCGCAA 


1197 


JUNB 


NM 002229 


4S04808 


Of 


"^-AAQOCA AC 

u-uuo^by-Ub 


TGGAAGACCAAGAGCGCAT 


1198 


JUNB 


NM 002229 




Of 


"^ AAOOCA A*7 

u-uuo2by-u / 


CATACACAGCTACGGGATA 


1199 


JUNB 


NM 002229 


4504808 


3726 


D-003269-08 


CCATCAACATGGAAGACCA 


1200 


LOC510 
53 















121 



LULrOlO 

53 


NM_015895 


20127542 


51053 


D-003270-05 


GGAGAAAGGCGCTGTATGA 


1201 


L0051 0 
53 


NM_01 5895 


20127542 


51053 


D-003270-06 


GAATAGTTCTGTCCCAAGA 


1202 


LUOol 0 
53 


NM_015895 


20127542 


51053 


D-003270-07 


GAACATGTACAGTATATGG 


1203 


LOC510 

53 

IVIMUZLI 


NM_015895 


20127542 


51053 


D-00327n-nft 


V3i^A\jjAAAOAAvjAAGAAATC 


1204 




IVjM 002358 


6466452 


4085 


D-003271-05 


GAAAGATGGCAGTTTGATA 


1205 




NM 002358 


6466452 


4085 


D-003271-06 


TAAATAATGTGGTGGAACA 


1206 


IVlAUZLl 


NM 002358 


6466452 


4085 


D-003271-07 


GAAATCCGTTCAGTGATCA 


1207 


MA not i 


NM 002358 


6466452 


4085 


D-003271-08 


TTACTCGAGTGCAGAAATA 


1208 


RiiAnoi o 














MAnoi o 


v\M 006341 


6006019 


10459 


D-003272-05 


GGAAGAGCGCGCTCATAAA 


1209 


ft/iAnoi o 


Nivl 006341 


6006019 


10459 


D-003272-06 


TGGAAGAGCGCGCTCATAA 


1210 


h/iAnoi o 


InM 00oo41 


6006019 


10459 


D-003272-07 


AGCCACTCCTGGAGAAGAA 


1211 


A/!Ar\OI o 


NM 006341 


6006019 


10459 


D-003272-08 


TGGAGAAATTCGTCTTTGA 


1212 


















NM 004526 


33356546 


4171 ' 


U-003273-05 


GAAGATCTTTGCCAGCATT 


1213 




NM 004526 


33356546 


4171 


D-003273-06 


GGATAAGGCTCGTCAGATC 


1214 




NM 004526 


33356546 


4171 


D-003273-07 


CAGAGCAGGTGACATATCA 


1215 




NM 004526 


33356546 


4171 


D-003273-08 


GCCGTGGGCTCCTGTATGA 


1216 


IVIOIVIo 














iVIOIVio 


NM 002388 


33356548 


4172 


D-003274-05 


GGACATCAATATTCTTCTA 


1217 


IVllMr mo 


NM 002388 


33356548 


4172 


D-003274-06 


GCCAGGACATCTCCAGTTA 


1218 


MOIVIo 


NM 002388 


33356548 


4172 


D-003274-07 


GCAGGTATGACCAGTATAA 


1219 


IvlOlvIo 


NM 002388 


33356548 


4172 


D-003274-08 


GGAAATGCCTCAAGTACAC 


1220 


ivlOlvi4 














ML#Ivl4 


XM 030274 


22047061 


4173 


D-003275-05 


GGACATATCTATTCTTACT 


1221 


mOIVI4 


AM 030274 


22047061 


4173 


D-003275-06 


GATGTTAGTTCACCACTGA 


1222 


lvlUlvI4 


XM 030274 


22047061 


4173 


D-003275-07 


CCAGCTGCCTCATACTTTA 


1223 


IViL»IVi4 


AM 030274 


22047061 


4173 


D-003275-08 


GAAAGTACAAGATCGGTAT 


1224 


iVlOlvtD 














IVlLrfViD 


NM 006739 


23510447 


4174 


D-003276-05 


GAAGATCCCTGGCATCATC 


1225 


IvIOIVlO 


NM 006739 


23510447 


4174 


D-003276-06 


GAACAGGGTTACCATCATG 


1226 


ivIOIVID 


NM 006739 


23510447 


4174 


D-003276-07 


GGACAACATTGACTTCATG 


1227 


ivIOlvID 


NM 006739 


23510447 


4174 


D-003276-08 


CCAAGGAGGTAGCTGATGA 


1228 


IViV^lvlD 








- 






ivioivio 


NM 005915 


33469920 


4175 


D-003277-05 


GGAAAGAGCTCAGAGATGA 


1229 




NM 005915 


33469920 


4175 


D-003277-06 


GAGCAGCGATGGAGAAATT 


1230 




NM 005915 


33469920 


4175 


D-003277-07 


GGAAACACCTGATGTCAAT 


1231 


vIOIViD 


NM 005915 


33469920 


4175 


D-003277-08 


CCAAACATCTGCCGAAATC 


1232 


















NM 005916 


33469967 


4176 


D-003278-05 


GGA/\ATATCCCTCGTAGTA 


1233 




NM 005916 


33469967 


4176 


D-003278-06 


GGAAGAAGCAGTTCAAGTA 


1234 


IVIV^Ivl f 


NM 005916 


33469967 


4176 


D-003278-07 


CAACAAGCCTCGTGTGATC 


1235 


ivlOIVI/ 


NM 005916 


33469967 


4176 


D-003278-08 


GGAGAGAACAGAAGGATTG 


1236 


















NM 002392 


45051 36 


4193 


D-003279-05 


GGAGATATGTTGTGAAAGA 


1237 


R/iniv/io 

VlUtvIZ 


NM 002392 


45051 36 


4193 


D-003279-06 


CCACAAATCTGATAGTATT 


1238 




NM 002392 


4505136 


4193 


D-003279-07 


GATGAGGTATATCAAGTTA 


1239 


IVJUIvl^ 


NM 002392 


45051 36 


4193 


0-003279-08 


GGAAGAAACCCAAGACAAA 


1240 


IvIIXIOr 
















NM 002417 


19923216 


4288 


D-003280-05 


GCACAAA6CTTGGTTATAA 


1241 


MKI67 


NM 002417 


19923216 


4288 




L»o 1 AflijAUU 1 oAACTATTT 


1242 


MKI67 


NM 002417 


19923216 


4288 


D-003280-07 


CAAAGAGGAACACAAATTA 


1243 



11 



MKI67 

MNAT1 

MNAT1 

MNAT1 

MNAT1 

MNAT1 

MYC 

MYC 

MYC 

MYC 

MYC 

0RC1L 


NM 002417 

NM 002431 
NM 002431 
NM 002431 
NM 002431 

NM. 002467 
NM 002467 
NM 002467 
NM 002467 


19923216 

4505224 
4505224 
4505224 
4505224 

31543215 
31543215 
31543215 
31 54321 «> 


4288 

4331 
4331 
4331 
4331 

4609 
4609 
4609 


D-003280-Oa 

D-003281-05 
D-003281-06 
D-003281-07 
D-003281-08 

D-003282-05 
D-003282-06 
D-003282-07 


. GTAAATGGGTCTGTTATTG 

GGAAGAAGCTTTAGAAGTR 
TAGATGAGCTGGAGAGTTC 
GGACCTTGCTGGAGGCTAT 
GCAGATAGAGACATATGGA 

CAGAGAAGCTGGCCTCCTA 
GAAACGACGAGAACAGTTG 
CGACGAGACCTTCATCAAA 
UCACACATCAGCACAACTA 


1244 

1245 
1246 
1247 
1248 

1249 
1250 
1251 
1252 


ORC1L 

ORC1L 

ORC1L 

ORC1L 

0RC2L 

0RC2L 

ORC2L 

ORC2L 

ORC2L 

ORC3L 

ORC3L 

ORC3L 

0RC3L 

0RC3L 

0RC4L 

0RC4L 

ORC4L 

ORC4L 

ORC4L 

ORC5L 

ORC5L 

0RC5L 

ORC5L 

0RC5L 

0RC6L 

0RC6L- 

0RC6L 

0RC6L 

ORC6L 

PCNA 

PCNA 

PCNA 

PCNA 

PCNA 

PLK 

PLK 

PLK 

PLK 

PLK 

PIN1 

PIN1 

PIN1 

PIN1 

PIN1 


NM 004153 
NM 004153 
NM 004153 
NM 004153 

NM 006190 
NM 006190 
NM 006190 
NM 006190 

NM 012381 
NM 012381 
NM 012381 
NM 012381 

NM 002552 
NM 002552 
NM 002552 
NM 002552 

NM 3002553 
NM 002553 
NM 002553 
NM 002553 

NM 014321 
NM 014321 
NM 014321 
NM 014321 

NM 002592 
NM 002592 
NM 002592 
NM 002592 

NM 005030 
NM 005030 
NM 005030 
NM 005030 

NM 006221 
NM 006221 
NM 006221 
NM_006221 


31795543 
31795543 
31795543 
31795543 

32454751 
32454751 
32454751 
32454751 

32483366 
32483366 
32483366 
32483366 

32454749 
32454749 
32454749 
32454749 

32454752 
32454752 
32454752 
32454752 

32454755 
32454755 
32454755 
32454755 

33239449 
33239449 
33239449 
33239449 

21359872 
21359872 
21359872 
21359872 

5453897 
5453897 
5453897 
5453897 , 


4998 
4998 
4998 
4998 

4999 
4999 
4999 
4999 

23595 
23595 
23595 
23595 

5000 
5000 
5000 
5000 

5001 
5001 
5001 
5001 

23594 
23594 
23594 
23594 

5111 
5111 
5111 
5111 

5347 
5347 
5347 
5347 

5300 
5300 
5300 
5300 


D-003283-05 
D-003283-06 
D-003283-07 
D-003283-08 

D-003284-05 
D-003284-06 
D-003284-07 
D-003284-08 

D-003285-05 
D-003285-06 
D-003285-07 
D-003285-08 

D-003286-06 
D-003286-07 
D-003286-08 
D-003286-09 

D-003287-05 
D-003287-06 
D-003287-07 
D-003287-08 

D-003288-05 
D-003288-06 
D-003288-07 
D-003288-08 

D-003289-05 
D-003289-07 
D-003289-09 
D-003289-10 

D-003290-05 
D-003290-06 
D-003290-07 
D-003290-08 

D-003291-05 
D-003291-06 
D-003291-07 
D-003291-08 i 


GAACAGGAATTCCAAGACA 
TAAGAAACGTGCTCGAGTA 
GAGATCACCTCACCTTCTA 
GCAGAGAGCCCTTCTTGGA 

GAAGAAACCTCCTATGAGA 
GAAGGGAACTGATG6AGTA 
GAAGAATGAT.CCTGAGATT 
GAAGAGATGTTCAAGAATC 

GGACTGCTGTGTAGATATA 
GAACTGATGACCATACTTG 
AAAGATCTCTCTGCCAATA 
CAGCACAGCTAAGAGAATA 

GAAAGCACATTCCGTTTAT 
TGAAAGAACTCATGGAAAT 
GCTGAGAAGTGGAATGAAA 
CCAGTGATCTTCATATTAG 

GAAATAACCTGTGAAACAT 
CAGATTACCTCTCTAGTGA 
GAACTTCCATATTACTCTA 
GTATTCAGCTGATTTCTAT 

GAACATGGCTTCAAAGATA 
GGACAGGGCTTATTTAATT 
GAAAGAAGATAGTGGTTGA 
TATCAGAGCTGTCTTAAAT 

GATCGAGGATGAAGAAGGA 
GCCGAGATCTCAGCCATAT 
GAGGCCTGCTGGGATATTA 
GTGGAGAACTTGGAAATGG 

CAACCAAAGTCGAATATGA 
CAAGAAGAATGAATACAGT 
GAAGATGTCCATGGAAATA 
CAACACGCCTCATCCTCTA 

GGACCAAGGAGGAGGCCCT 
CGTCCTGGCGGCAGGAGAA 
CGGGAGA6GAGGACTTTGA 
^GTCGGGAGAGGAGGACTT 


1253 
1254 
1255 
1256 

1257 
1258 
1259 
1260 

1261 
1262 
1263 
1264 

1265 
1266 
1267 
1268 

1269 
1270 
1271 
1272 

1273 
1274 
1275 
1276 

1277 
1278 
1279 
1280 

1281 
1282 
1283 
1284 

1285 
1286 

1288 



PIN1L 



PIN1L 
PIN1L 



NM 006222 



PIN1L 
PIN1L 



RAD1 



RAD1 



RAD1 



RAD1 



RAD1 



RAD17 



RAD17 



RAD17 



RAD17 



RAD17 



RAD9A 



RAD9A 



RAD9A 



RAD9A 



RAD9A 



RB1 



RB1 



RB1 



RBI 



RB1 



RBBP2 



NM 006222 



NM 006222 



NM 006222 



NM 002853 19718797 



NM 002853 



5453899 



5453899 



5453899 



5453899 



NM 002853 



NM 002853 



NM 133338 



NM 133338 



NM 133338 



NM 133338 



NM 004584 



NM 004584 



NM 004584 



NM 004584 



19718797 



19718797 



19718797 



19718783 



19718783 
19718783 



19718783 



19924112 



19924112 



19924112 



19924112 



NM 000321 



NM 000321 



NM 000321 



NM 000321 



19924112 



19924112 



4506434 



4506434 



5301 



5301 



5301 



5301 



5810 
5810 



5810 



5810 



5884 



5884 



5884 



5884 



5883 



5883 



5883 



5883 



5925 



5925 



5925 



5925 



D-003292-06 



D-003292-07 
D-003292-08 



D-003292-09 



D-003293-05 



D-003293-06 



D-003293-07 



D-003293-08 



D-003294-05 



D-003294-06 



D-003294-07 



D-003294-08 



D-003295-05 



D-003295-06 



D-003295-08 



D-003295-09 



D-003296-05 



D-003296-06 



D-003296-07 



D-003296-08 



CGACCTGGCGGCARPiAAAT 



AGGCAGGAGAGAAGGACTT 



GCTACATCCAGAAGATCAA 
GGACAGTCTTCACGGATTC 



^AAGATGGACAAATATGTT 



GGAAGAGTCTGTTAr.TTr 



GATAACAGAGGCTrnn-m- 



GCATTAGTCCTATm-rriTA 



GAATCAAGCTTCCATATnT 



CAACAAAGCCCGAGGATAT 



ACACATGCCTGGAGACTTA 



CTACATAGATTTCTTnATC 



TCAGCAAACTTGAATCTTA 



GACATTGACTCTTAnATriA 



GGAAACCACTATAGnCAAT 



CGGACGACTTTGCCAATfiA 



GAAAGGACATGTGAACTTA 



GAAGAAGTATGATGTATTG 



GAAATGACTTCTAr.Tnr:;AA 



GGAGGGAACATCTATATTT laoft 



1289 



1290 



1291 
1292 



1293 



1294 



1295 



1296 



1297 



1298 



1299 



1300 



1301 



1302 



1303 
1304 



1305 



1306 



1307 



RBBP2 



NM 005056 



4826967 



5927 



D-003297-05 



CAAAGAAGCTGAATAAAr.T 



1309 
1310 



NM 005056 



4826967 



RBBP2 



5927 



NM 005056 



RBBP2 



4826967 



D-003297-06 



NM 005056 



5927 



4826967 



D-003297-07 



CAACACATATGGCGGATTT 



RBL1 



5927 



D-003297-08 



GGACAAACCTAGAAAPtAAfi 



RBL1 



RBL1 



NM 002895 34577078 



NM 002895 



5933 



RBL1 



NM 002895 



RBL1 



NM 002895 



RBL2 



RBL2 



NM 005611 



34577078 



D-003298-05 



34577078 



5933 
5933 



D-003298-06 



D-003298-07 



34577078 5933 



21361291 



D-003298-08 



5934 



D-003299-05 



GAAAGGCACTCTCTCTGTT 



1311 
1312 



CAAGAGAAGTTGTGGCATA 
CAGCAGCACTCCATTTATA 



1313 



ACAGAAAGGTCTATCATTT 



1314 



GGACATAAAGTTACAATTn 



1315 



1316 



GAGCAGAGCTTAATCriAAT 



NM 005611 



RBL2 



21361291 



NM 005611 



RBL2 



21361291 



5934 
5934 



D-003299-06 



NM 005611 



RBP1 
RBP1 



21361291 



D-003299-07 



GAGAATAGCCCTTGTGTGA 



5934 



D-003299-08 



GGACTTAGTTTATGGAAAT 
GAATTTAGATGAGCGGATA 



NM 002899 



8400726 



RBP1 



5947 



NM 002899 



RBP1 



8400726 



D-003300-05 



NM 002899 



5947 



RBP1 



8400726 



D-003300-06 



GAGACAAGCTCCAGTGTGT 



5947 



NM 002899 



RPA3 



8400726 



D-003300-07 



GCAAGCAAGTATTCAAGAA 



5947 



RPA3 



RPA3 



RPA3 



RPA3 



SKP1A 



SKP1A 



SKP1A 



SKP1A 



SKP1A 
SKP2 



D-003300-08 



GCAGGACGGTGACCATATG 



GCAAGTGCATGACAACAGT 



NM 002947 



NM 002947 



NM 002947 



NM 002947 



NM 006930 



NM 006930 



NM 006930 
NM_006930 



19923751 



6119 



19923751 



D-003322-05 



6119 



19923751 



1-003322-06 



GGAAGTGGTTGGAAGAGTA 



6119 



19923751 



-003322-07 



GAAGATAGCCATCCTTTTG 



6119 



D-003322-08 



CATGCTAGCTCAATTCATC 



GATCTTGGACnTACAATG 



25777710 



25777710 



6500 D-003323-05 



25777710 



6500 D-003323-06 



GGAGAGATATTTGAAGTTG 



6500 



25777710 



6500 



D-003323-07 



GGGAATGGATGATGAAGGA 



D-003323-08 



CAAACAATCTGTGACTATT 



TCAATTAAGTTGCAGARTT 



1318 



1319 



1320 



1321 



1322 



1323 



1324 



1325 
1326 



1327 



1328 



1329 



1330 



1331 



1332 



SKP2 



SKP2 



SKP2 



SKP2 



SNK 



SNK 



SNK 
SNK 
STK12 



STK12 
STK12 
STK12 
STK12 
TFDP1 



TFDP1 



TFDP1 
TFDP1 



TFDP1 
TFDP2 



TFDP2 



TFDP2 



TFDP2 



TFDP2 



TP53 



TP53 



TP53 
TP53 
TP53 



TP63 
TP63 



TP63 



TP63 
TP63 
TP73 
TP73 



TP73 



TP73 



TP73 



YWHAZ 



NM 005983116306594 



NM 005983 



NM 005983 



NM 005983 



NM 006622 



NM 006622 



NM 006622 
NM. 006622 



NM 004217 
NM 004217 
NM 004217 
NM 004217 



NM 00711 



YWHAZ 



YWHAZ 
YWHAZ 
YWHAZ 

TaMeK 



NM_00711 1 
NM 00711 



NM 007111 



NM 006286 



NM 006286 



NM 006286 



NM 006286 



NM 000546 



NM 000546 
NM_ 000546 



NM 000546 



NM 003722 



NM_ 003722 



NM 003722- 
NM_003722 

NM 005427 



NM 005427 



NM 005427 



NM 005427 



NM_ 003406 



NM 003406 
VIM 003406 
NM 003406 



16306594 
16306594 



16306594 



5730054 



5730054 
5730054 



5730054 



4759177 
4759177 
4759177 
4759177 



34147667 



34147667 
34147667 



34147667 



5454111 



5454111 



5454111 



5454111 



6502 



6502 
6502 



6502 



10769 



10769 



10769 
10769 



9212 
9212 
9212 
9212 



7027 
7027 
7027 



7027 



7029 



7029 



7029 



7029 



8400737 



8400737 
8400737 



8400737 



31543817 



7157 



7157 
7157 
7157 



31543817 



31543817 
31543817 

4885644 



4885644 



4885644 



4885644 



21735623 



21735623 
21735623 
21735623 



8626 



8626 



8626 
8626 

7161 



7161 
7161 



7161 



7534 



7534 
7534 
7534 



D-003324-05 



D-003324-06 



D-003324-07 



D-003324-08 



D-003325-05 



D-003325-06 



D-003325-07 
D-003325-08 



D-003326-07 
D-003326-08 
D-003326-09 
D-003326-10 



D-003327-05 



D-003327-06 
D-003327-07 



D-003327-08 



D-003328-05 



P-003328-06 



D-003328-07 



D-003328-08 



D-003329-05 



D-003329-06 
D-003329-07 



D-003329-08 



D-003330-05 



CATCTAGACTTAAfiTPiATA 



GAAATCAGATCTCTCTACT 



CTAAAGGTCTCTGRTfiTTT 



GATGGT^CCCTTCAACTGT 



GAAGACATCTACAAGCTTA 
GAAATACCTTCATGAACAA 



GAAGGTCAATGGCTCATAT 
CCGGAGATCTCGnftrtATTA 



CAGAAGAGCTGCACATTTG 
CCAAACTGCTCAGGCATAA 
ACGCGGCACTTCACAATTG 
TGGGACACCCGACATmTA 



GGAAGCAGCTCTTGCCAAA 



GAGGAGACTTGAAAGAATA 
GAACTTAGAGGTGGAAAp;a 



GCGAGAAGGTGCAGAGGAA 



GAAAGTGTGTGAGAAAGTT 
CACAGGACCTTCTTfiP^TTA 



1333 
1334 



1335 



1336 



1337 



1338 



1339 
1340 



1341 
1342 
1343 
1344 



1345 



1346 
1347 



CGAAATCCCTGGTGCCAAA 



TGAGATCCATGATGACATA 



GAGGTTGGCTCTGAnTriTA 



CAGTCTACCTCCCGCCATA 
GCACAGAGGAAGAGAATCT 



GAAGAAACCACTGGATGGA 



D-003330-06 



D-003330-07 
D-003330-08 



0-003331-05 



D-003331-06 



D-003331-07 



D-003331-08 



D-003332-05 



D-003332-06 
3-003332-07 
D-003332-08 



CATCATGTCTGGACTATTT 



CAAACAAGATTGAGATTAPi 



GCACACAGACAAATGAATT 
CGACAGTCTTGTACAATTT 

GCAAGCAGCCCATCAAfif^ 



GAGACGAGGACACGTACTA 



CIGCAGAACCTGACCATTG 



GGCCATGCCTGnTACAAG 



GCAAGGAGCTGAATTATCC 



TAAGAGATATCTGCAATGA 

6ACGGAAGGTGCTGAGAAA 

AGAGCAAAGTCTTCTATTT 



1349 



1350 



1351 



1352 



1353 



1354 
1355 



1356 



1357 



1358 



1359 
1360 

1361 



1362 



1363 



1364 



1365 



1366 
1367 
1368 



Gene 
Nam e 
AR 
AR 



Accession # 

NM_000044 
NM_000044 



Gl# 

21322251 
21322251 



Duplex # 

D-003400-01 
D-003400-02 



Sequence 

ggaactcgatcgtatcatt" 

CAAGGGAGGTTACAnnAAA 



SEQ. ID 
NO 

1369 
1370 



NM_.000044 



AR 



21322251 



NM 000044 



21322251 



D-003400-03 
D-003400-04 



IQAAGGAACTCGATCGTAT 



GAAATGATTGCACTATTrtA 



1371 



ESR1 



NM 000125 



ESR1 



4503602 



NM 000125 



ESR1 



NM_000125 



4503602 



D-003401-01 



D-003401-02 



GAATGTGCCTGGCTAfiAPiA 



4503602 



D-003401-03 



CATGAGAGCTGCCAACniT 



1373 



AGAGAAAGATTGGCCAGTA 



1374 



1375 



toRI 


NM U0U12O 


4o0o602 


n f\f\^ Af\A f\A 

D-003401-04 


CAAGGAGACTCGCTACTGT 


1376 














CODO 

toK2 


\m UU14o/ 


1 0oo501 2 


n\ Ar\o>ioo f\A 

U-00o402-01 


GAACATCTGCTCAACATGA 


1377 


boK2 


MM 0U14o7 


1 083501 2 


Pi AAO>IAO AO 

U-003402-02 


GCACGGCTCCATATACATA 


1378 




Klft>? AA^l >107 


1 0oo501 2 


AA0>ir*O AO 

U-003402~03 


CAAGAAGATTCCCGGCTTT 


1379 




nM Uu14o/ 


IO0OOOI2 


□-003402-04 


i^/^ AAA TT/^/^/— »T* A A A A A -1- 

GGAAATGCGTAGAAGGAAT 


1380 














rr ODD A 


NM 004451 


1 886091 9 


D-003403-01 


GGCCTTCGCTGAGGACTTA 


1381 


toRRA 


Kl^il f\f\A ACA 

NIvl 004451 


1 886091 9 


D-003403-02 


TGAATGCACTGGTGTCTCA 


1382 


CIODD A 

coRrv\ 


NM 004451 


18860919 


D-003403-03 


GCATTGAGCCTCTCTACAT 


1383 


hoRRA 


IMM 004451 


18860919 


D-003403-04 


^^^^ A A A AAA .^^•^^^ 

CCAGACAGCGGGCAAAGTG 


1384 














IZODDD 

coRRd 


NM 004452 


22035686 


L)-003404-01 


T" A /^^^T"^^ A mtwmm - AAA 111 

TAGCTGAGCTTACAAATTT 


1385 


ESRRB 


NM 004452 


22035686 


D-003404-02 


GCACTTCTATAGCGTCAAA 


1386 


hoRRb 


NM 004452 


00/10 C£?0^ 

22035686 


L)-003404-03 


CAACTCCGATTCCATGTAC 


1387 


tbRRb 


NM_004452 


22035686 


D-003404-04 


GGACTCGCGACCCATGTTT 


1388 














EoRRG 


KUVA /\f\A A^O 

NM 001438 


4503604 


D-003405-01 


AAA AAA A A A A ■ 1 ' "ir 

AAACAAAGATCGACACATT 


1389 


ESRRb 


KIIV/I f\f\A A^O 

NM_0014oo 


4503604 


D-003405'02 


TCAGGAAACTGTATGATGA 


1390 


ESKRG 


KIRA >IOO 

NM 001 4oo 


A C/\0 Gf\ A 

4503604 


0-003405-03 


GAAGACCAGTCCAAATTAG 


1391 


EbRRb 


Klftil /\f\A A*^Q 

NM 001438 


A CnoO/\ A 

4503604 


0-003405-04 


ATGAAGCGCTGCAGGATTA 


1392 














HNr4A 


NM 000457 


21361184 


D-003406-01 


CGACATGACTGGAGCATAT 


1393 


I_1KIIT>1 A 


NM 000457 


21361184 


D-003406-02 


GAAGGAAGCCGTCCAGAAT 


1394 


liNr-4A 


NM 000457 


21361184 


D-003406-03 


GGAAGTACATCCCAGCTTT 


1395 


nNh4A 


NM 000457 


21361184 


0-003406-04 


GGACATGGCCGACTACAGT 


1396 














HNF4G 


NM 004133 


6631087 


D-003407-01 


A A A AAA ^^^r^" A A 

GCACTGACATAAACGTTAA 


1397 


HNF4G 


NM 004133 


6631087 


t^f\t\ Ar\.'^ 

D-003407-02 


ACAAAGAGATCCATGATGT 


1398 


HNF4G 


NM 004133 


6631087 


D-003407-03 


AGAGATCCATGATGTATAA 


1399 


LIKIE" Ai^ 

HNF4G 


KIR /) f\r\ A A 

NM 004133 


6631087 


D-003407-04 


AAA A A A A A A A 

AAATGAACGTGACAGAATA 


1400 














1 lO A lO >l OCT 

HS A J 2425 


NM 017532- 


8923776 


D-003408-01 


GAATGAATCTACACCTTTG 


1401 


HSAJ2425 


NM 017532 


8923776 


D-003408-02 


GGAAATACGTG6AGACACT 


1402 


HSAJ2425 


NM 017532 


8923776 


0-003408-03 


CCAGATAACTACGGCGATA 


1403 


nSAJ2425 


NM 017532 


8923776 


O-003408-04 


TGGCGTACCTTCTCATTGA 


1404 














NR0B1 


NM 000475 


5016089 


D-003409-01 


CAGCATGGATGATAT.GATG 


1405 


KIDrkD A 


NM 000475 


a\A ^f\^i\ 

5016089 


0-003409-02 


CTGCTGAGATTCATCAATG 


1406 


i^mrvD A 

NROBI 


NM 000475 


5016089 


D-003409-03 


A A A ^F^^^^ A A A X^*^*^ A A 

ACAGATTCATCGAACTTAA 


1407 


NR0B1 


NM 000475 


5016089 


O-003409-04 


GAACGTGGCGCTCCTGTAC 


1408 














NR0B2 


NM 021969 


13259502 


D-003410-01 


GAATATGCCTGGCTGAAAG 


1409 


NR0B2 


NM 021969 


13259502 


D-003410-02 


A A A ^^^^ ■ AAA 

GGAATATGCCTGCCTGAAA 


1410 


NR0B2 


NM 021969 


13259502 


O-OO 3410-03 


CGTAGCCGCTGCCTATGTA 


1411 


NR0d2 


NM 021969 


1 3259502 


n f\f\*^AAf\ f\A 

O-003410-04 


GCCATTCTCTACGCACTTG 


1412 














NR1D1 


NM 021724 


1 3430847 


D-003411-01 


CAACACAGGTGGCGTCATCTT 


1413 


NR1D1 


NM 021724 


1 3430847 


r\ AAA 

O-003411-02 


GGCATGGTGTTACTGTGTATT 


1414 


NK1D1 


NM 021724 


13430847 


O-003411-03 


CAACATGCATTCCGAGAAGTT 


1415 


NR iD 1 


NM 021724 


1 3430847 


A ^ A A A 

D-003411-04 


GCGCTTTGCTTCGTTGTTCTT 


1416 














NR1n2 


NM 007121 


11321629 


D-003412-01 


GAACAGATCCGGAAGAAGA 


1417 


NR1n2 


NM 007121 


11321629 


0-003412-02 


GAAGAACAGATCCGGAAGA 


1418 


IN fx 1 \ 


MM 0(171 9i 


iiOOHOOA 


U-UUO'H z-uo 


ox A Aor^ A Ar^Tr^^r*Tf^r^'\ i r 
0 1 AAloLf AAVj 1 oL/U 1 ovj 1 1 1 


4 >f 'I A 

i4jy 


NR1H2 


NM 007121 


11321629 


D-003412-04 


GCTAACAGCGGCTCAAGAA 


1420 



NR1H3 
NR1H3 
NR1H3 
NR1H3 

NR1H4 
NR1H4 
NR1H4 
NR1H4 

NR1I2 
NR1I2 
NR1I2 

NR1 17 


NM 005693 
NM 005693 
NM 005693 
NM 005693 

NM 005123 
NM 005123 
NM 005123 
NM 005123 

NM 003889 
NM OO'^ARQ 
NM 003889 


5031892 
5031892 
5031892 
5031892 

4826979 
4826979 
4826979 
4826979 

11863133 

i 1 ODo 1 oo 

11863133 


D-003413-01 
D-003413-02 
D-003413-03 
D-003413-04 

D-003414-01 
D-003414-02 

D-003414-03 
D-003414-04 

D-003415-01 
l->-U03415-02 
D-003415-03 


GAACAGATCCGCCTGAAGA 
GGAGATAGTTGACTTTGCT 
GAGI 1 IGCCTTGCTCATTG 
TGACTTTGCTAAACAGCTA 

CAAGTGACCTCGACAACAA 
GAAAGAATTGGAAATAGTG 
CAACAGACTCTTCTACATT 
GAACGATACTCGCAATACA 

GAACCATGCTGACTTTGTA 
GATGGACGCTCAGATGAAA 
CAACGTACATGTTCAAAGG 


1421 
1422 
1423 
1424 

1425 
1426 
1427 
1428 

1429 
1430 
1431 


vit\ 1 1^ 


INivl__003o89 


11863133 


D-003415-04 


CAG6A6CAATTCGCCATTA 


1432 


NR1I3 
NR1I3 
NR1I3 
NR1I3 

NR2C1 
NR2C1 
NR2C1 
NR2C1 

NR2C2 
NR2C2 
NR2C2 
NR2C2 


NM 005122 
NM 005122 
NM 005122 
NM 005122 

NM 003297 
NM 003297 
NM 003297 
NM 003297 

NM 003298 
NM 003298 
NM 003298 
NM 003298 


4826660 
4826660 
4826660 
4826660 

4507672 
4507672 
4507672 
4507672 

4507674 

4507674 
4507674 


D-003416-01 
D-003416-02 
D-003416-03 
D-003416-04 

D-003417-01 
D-003417-02 
D-003417-03 
3-003417^04 

D-003418-01 

D-003418-03 


GGAAATCTGTCACATCGTA 
TCGCAGACATCAACACTTT 
CCTCTTCGCTACACAATTG 
GAACAGTTI GTGCAGTTTA 

TGACAGCACTTGATCATAA 
GGAAGGAAGTGTACACCTA 
GAGCACATCTTCAAACTAC 
GAAGAAATTGCACATCAAA 

GAACAACGGTGACACTTCA 
0 1 CpATGAGCTCCAACATAA 
CAACCTAAGTGAATCTTTG 


1433 
1434 
1435 
1.436 

1437 
1438 
1439 
1440 

1441 
1442 
1443 


NR2E1 
NR2E1 
NR2E1 
NR2E1 

NR2E3 
NR2E3 
NR2E3 
NR2E3 

NR2F1 

NR2F1 


NM 003269 
NM 003269 
NM 003269 
NM 003269 

NM 014249 
NM 014249- 
NM 014249 
NM 014249 

NM 005654 
NM U05d54 
NM 005664 ' 


4507674 

21361108 
2136-12108 
21361108 
21361108 

7657394 
7657394 
7657394 
7657394 

20127484 
20127484 
20127484 " 


D-003418-04 

D-003419-01 
D-003419-02 
D-003419-03 
D-003419-04 

D-003420-01 
D-003420-02 
D-003420-03 
D-003420-04 

D-003421-01 
3-003421-02 ' 
D-003421-03 ' 


gaagacacctaccgattgg 

gatcatatctgaaatagag 
caagactgctttcagatat 
gttagatgctactgaattt 
caatgtatctctatgaagt 

gagaagctcctttgtgata 
gaagcactatggcatctax 
gaaggatcctgagcacgta 
gaagctcgtttgtgatatg 

gaaactctcatgcgcgata 
tctcatccgcgatatgtta 
caagaagtgcgtcaaagtg 


1444 

1445 
1446 
1447 
1448 

1449 
1450 
1451 
1452 

1453 
1454 
1455 


NR2F1 

NR2F2 
NR2F2 
NR2F2 
NR2F2 

NR2F6 
NR2F6 
NR2F6 
R2F6 


NM 005654 

NM 021005 
NM 021005 
NM 021005 
KIM 021005 

MM 005234 , 
MM_ 005234 , 
VJM 005234 ; 
MM 005234 : 


20127484 

14149745 
14149745 
14149745 
14149745 

20070198 
20070198 
20070198 
20070198 


D-003421-04 

D-003422-01 
D-003422-02 
D-003422-03 . 
D-003422-04 

D-003423-01 ( 
D-003423-02 ( 
D-003423-03 ( 
3-003423-04 ( 


GGAACTTAACTTACACATG 

GTAGCTGTGCGGATATATT 
CCAACCAGGCGACGAGATT 
ftCTCGTACCTGTCCGGATA 
GGCCGTATATGGCAATTCA 

PGACGCCTGTGGCCTGTCA 
3AGGCGGTGTCCGAAGTGA 
3AAGCGTGACTGCCAGATC 

— ■ ■ 


1456 

1457 
1458 
1459 
1460 

1461 
1462 
1463 
1464 1 







J 27 


NR3C1 

INrxOv-/ i 


NM 000176 

InIVI UUUI f o 


4504132 
4504132 


D-003424-01 
D-003424-02 


GAGGACAGATGTACCACTA 

ATA A A ATr^ A r*x a tt/^ 


1465 
1466 


NR3C1 
iNr\ov> 1 


NM 000176 
IMM 000176 


4504132 
4504132 


D-003424-03 
D-003424-04 




1467 
1468 
















Nivi uooyoi 


4505198 


D-003425-01 


GCAAAPAGATni ATrr^A AOT 


1469 




NIvi 000901 


4505198 


D-003425-02 


GAGGTAAGATTTATr* Ar» A A 


1470 




IMM 000901 


4505198 


D-003425-03 


GGAPGAAAriTr^A A AOA A/-^T 
vjoMv^vDAAAo 1 OAAAoAAGT 


1471 




IMM 000901 


4505198 


D-003425-04 


GGT ATPPrs (^Tr*Tx A A A -r A 
vjo 1 M 1 ov^v3^ 1 0 1 1 AoAATA 


1472 














MRilAi 


NM 002135 


21361341 


D-003426-01 


GAAGGA AriTTr^xr^r*r* a a 0 a 


1473 


WR4A1 
IMrx'f/A 1 


iMM 002135 


21361341 


D-003426-02 


n AGG Af5 Ant 1 1 1 A A r^/^-r-r 


1474 


KIR/! Ai 


iMM 002135 


21361341 


D-003426-03 


n AGTGG PTPxrzi A r^x a /-»-r a -r 
1 000 1 0 1 vjAU 1 ALr 1 AT 


1475 




NM 002135 


21361341 


D-003426-04 


GAAGGPPr^PTr^Tnr^xr^T/-»-r 


1476 
















NM OOolob 


5453821 


D-003427-01 


GO A ATG PGTTr*/^xr* r» r>TT-r 


1477 




NM OObloD 


5453821 


D-003427-02 


oovjo 1 aoaqaovjAoAG 1 1 1 


1478 


MR/1 A 9 


NM 00o1c>D 


5453821 


D*003427-03 


CnAPGTriAPTTTr^A Ar*A at 
oo/nv^o 1 oAL# 1 1 1 OAAOAA r 


1479 


NIRAA9 


NM OObloO 


5453821 


D-003427-04 


GAATAP Af^ r*Tr*r^r^ a i i i r^T 


1480 














MRAA'^ 


NM 00b9o1 


11276070 


D-003428-01 


r* A A A G A A ATr* a A A T-r A 
oaaaijaavjA 1 UAGACATTA 


1481 


MRAA'^ 
IN r\Hr\u 


NM 00b9o1 


11276070 


D-003428-0? 


G ATr* A A P A TT A OXT A T—r/^ 

OA 1 \-^avjAOA 1 1 ALf 1 f ATTG 


1482 




NM 006981 


11276070 


D-003428-03 


PP AG Ar^ ATPTxn" A XX A -TT/^ 
wOAOAoA lull 1 I A 1 r C 


1483 


KIR^AQ 


NM 006981 


11276070 


D-003428-04 


GAAr^XX/^X/^r*/^TAr> A/^ at a 
OAAVj 1 1 1 AOAGATA 


1484 














MRRA-t 
IMrxOM 1 


NM 004959 


20070192 


D-003429-01 


GAT 1 1 A A /^XX^/^Tr^ A A -r A 
VJA 1 1 1 oAAo 1 1 1 oAATA 


1485 


KIpCA'l 

fMr\DAl 


NM 004959 


20070192 


D-00342Q-n? 


GGA<^P/^A0 (^Tr^ r^~rr^ r^T-r^-r-r 

ooaoovjA^jU 1 GO 1 GGTGTT 


1486 


I>Jr\Or\ 1 


NM 004959 


20070192 


D-00342Q-n'^ 


ooAovj I GGUUGACCAGATG 


1487 


vipc Ai 


NM 004959 


20070192 


D-003429-04 


PA APf^X/^n*/^X/^ A r^r^~rr^ at/-v 

\^AAoG 1 GUU 1 GAGCTCATC 


1488 














MR«;A9 


NM 003622 


20070161 


D-0034'^0-01 


PP A A A P A X AX/^ r^/^ A /^"T— r-r 

OL/AAAOA 1 A 1 GGCCACTTT 


1489 




NM 003822 


20070161 


D-0034?0^0? 


TPAni Ar^ A Ar^TT A A r^r^'i 1 r> a 

1 t^AOAl^AAO 1 1 AAGGTTGA 


1490 




NM 003822 


20070164=; 


13-003430-03 


GG ATPP AXPXxr^r^T/^/^T—r a 
ovja I v^oA 1 0 1 1 IGGTTA 


1491 




NM 003822 


20070161 


D-003430-04 


AAGAATAPPXr*TA/^*T Ar* a a 

AAoAA 1 AV-pO 1 U 1 AO r ACAA 


1492 
















NM 033334 


15451847 


D-0034'^1 -01 


PA APni A Ar^/^Tr^Tr^Tr^ ai-i-t- 

UAAOGAAOO 1 G rCTCATTT 


1493 


Mpc Ai 


K.livii noooo>« 

NM 033334 


15451847 


D-003431-.02 


GAAr^A APXAr* A/^ A a "i tt a 
V3AAV3AAO 1 AOAGAGA MIA 


1494 


vipo Ai 


NM 033334 


15451847 


D-0034'^1-n'^ 


G A Al^ AX/^n* AT A /^/^/^Tr^Ti^ A 

oaaijA 1 GGA 1 AGGCTGJGA^ 


1495 


MRRAi 


NM 033334 


15451847 


D-003431-04 


aaaovjA 1 AO 1 GG 1 AOA III* 


1496 














ni ill 

null 


U1bo15 


2116671 


D-003432-01 


GAAr^A AXr^ ATr^r^ a A~r a^^ at 
oAAijiAA 1 GA 1 OGAA 1 AGAT 


1497 


null 

nun 


Ulbolo 


2116671 


D-003432-09 


GA APAXO/^ A/^/^ a atata at 

OAAOA 1 GGAGOAATATAAT 


1498 


null 

nun 


U1b815 


2116671 


D-003432-03 


G AGG A rtir*XPTXr^/^r*r*TrT-r A 
OAvJvj AGCf 1 0 M GGOC 1 1 1 A 


1499 


null 

null 


U16815 


2116671 


D-003432-04 


TA A AP A An* AXO/^ A r^Ti^irr^ a 
1 AAAOAAOA i GOAO rCTGA 


1500 
















NM 000926 


4505766 


D-003433-01 


GAG AT/^ Aril^XO' A Af^r^Tur^ a 
oavja I VjAGG 1 OAAGOTACA 


1501 




NM 000926 


4505766 


D-003433-02 


P AGPGTTTPX AXn A A /^TT a 
v-rMoo\j 1 1 1 1 A 1 OAAO I lA 


1502 




NM 000926 


4505766 


D-003433-03 


AG ATA APXPxr* Axxr* a r^-r a 
MO A 1 AAo 1 0 1 OA I 1 OAGTA 


1503 


priR 


NM O0092b 


4505766 


D-003433>04 


GTAGTPAAI^XPOXOXA A at 
vj 1 MO 1 OAAVJ 1 GG 1 0 1 AAA r 


1504 














PPARA 


NM OOoOob 


7549810 


D-003434-01 


TPAPGGAniPXn^ Aor*o a att 
i wAOooAvjl>> 1 OAOGGAA 1 T 


1505 


PPARA 
ArxA 


IMM- OOoOoO 


7549810 


D-003434-02 


fnAAPAXn^An AX A/^ A AO^ ATT 

pAAOA 1 oAOA 1 AGAAG ATT 


1506 


PPARA 
t^n Ar\A 


NM OO0O06 


7549810 


□-003434-03 


fnGAXAniXXPXr^/^ A Ar^/-»T ■ r* 
ooA 1 AO 1 1 0 1 GGAAGOl 1 1 


1507 


PPARA 
r^r^ArxA 


NM 005036 


7549810 


D-003434-04 


GACTCAAGCTGGTGTATGA 


1508 














PPARD 


NM 006238 


5453939 


D-003435-01 


GAGCGCAGCTGCAAGATTC 


1509 



JZ9 



PPARD 
PPARD 
PPARD 


NM 006238 
NM 006238 
NM 006238 


5453939 
5453939 
5453939 


D-003435-02 
D-003435-03 
D-003435-04 


GCATGAAGCTGGAGTACGA 
GGAAGCAGTTG6TGAATGG 
GCTGCAAGATTCAGAAGAA 


1510 
1511 
1512 


PPARG 


NM 138712 


20336234 


D-003436-01 


AGACTCAGCTCTACAATAA 


1513 


PPAKb 


NM 138712 


20336234 


D-003436-02 


GATTGAAGCTTATCTATGA 


1514 


PPAKo 


NM 138712 


20336234 


D-003436-03 


AAGTAACTCTCCTCAAATA 


1515 


PPARG 


NM 138712 


20336234 


D-003436-04 


GCATTTCTACTCCACATTA 


1516 














D A D A 

KARA 


NM 000964 


4506418 


D-003437-01 


GACAAGAACTGCATCATCA 


1517 


DA DA 

KARA 


NM 000964 


4506418 


D-003437-02 


GCAAATACACTACGAACAA 


1518 


D A D A 

KARA 


NM 000964 


4506418 


D-003437-03 


GAACAACAGCTCAGAACAA 


1519 


O A D A 

KARA 


NM 000964 


4506418 


D-003437-04 


GAGCAGCAG1TCTGAAGA6 


1520 














RARB 


NM 000965 


14916493 


D-003438-01 


GCACACTGCTCAATCAATT 


1521 


DA DD 

KARB 


NM 000965 


14916493 


D-003438-02 


GCAGAAGTATTCAGAAGAA 


1522 


D A OD 

KARB 


NM 000965 


14916493 


D-003438-03 


GGAATGACAGGAACAAGAA 


1523 


D A nr> 

rwRB 


NM 000965 


14916493 


D-003438-04 


GCACAGTCCTAGCATCTCA 


1524 














rv\RG 


NM 000966 


21359851 


D-003439-01 


GAAATGACCGGAACAAGAA 


1525 


rxARG 


NM 000966 


21359851 


D-003439-02 


TAGAAGAGCTCATCACCAA 


1526 


RARG 


NM 000966 


21359851 


D-003439-03 


CAAGGAAGCTGTGCGAAAT 


1527 


D A O/^ 

RARG 


NM 000966 


21359851 


D-003439-04 


TCAGTGAGCTGGCTACCAA 


1528 














RORA 


NM 134261 


19743902 


D-003440-01 


GGAAAGAGTITATGTTCTA 


1529 


RORA 


NM 134261 


19743902 


D-003440-02 


CAA6ATCTGTG6AGACAAA 


1530 


RORA 


NM 134261 


19743902 


D-003440-03 


GCACCTGAGTGAAGATGAA 


1531 


RORA 


NM J 34261 


19743902 


D-003440-04 


CCGAGAAGATGGAATACTA 


1532 














RORB 


NM 006914 


19743906 


D-003441-01 


GCACAGAAGATCATTAAGT 


1533 


RORB 


NM 006914 


19743906 


D-003441-02 


CCACACCTATGAAGAAATT 


1534 


RORB 


NM 006914 


19743906 


D-003441-03' 


GATCAAATTCTACTTCTGA 


1535 


RORB 


NM 006914 


19743906 


D-003441-04 


TCAAACAGATAAAGCAAGA 


1536 














RORC 


NM 005060 


19743908 


D-003442-01 


TAGAACAGCTGCAGTACAA 


1537 


RORC 


NM 005060 


19743908 


D-003442-02 


TCACCGAGGCCATTCAGTA 


1538 


RORC 


NM 005060 


19743908 


DM)03442-03 


GAACAGCTGCAGTACAATC 


1539 


RORC 


NM 005060 


19743908 


D-003442-04 


CCTCATGCCACCTTGAATA - 


. .1540 












• 


DVD A 


NM 002957 


21536318 


D-003443-01 


TGACGGAGCTTGTGTCCAA 


1541 


RXRA 


NM 002957 


21536318 


3-003443-02 


OAAPA Ar^r^ Ar^Tr^i^^xr^ att 

V-»AAM^^MMV3V3AV-» 1 oLfO 1 VjA t i 


1542 


RXRA 
RXRA 


NM 002957 
NM 002957 


21536318 
21536318 


D-003443-03 
>003443-04 


GCAAGGACCTGACCTACAC 
C^ClAJli^c^Ar^r^f^r^ a A/^r^ a/^ a a 


1543 


r<XRB 


NM 021976 


21687229 


D-003444-01 


vjoa\mvjVj/\1^\>^vj o AAUvj AG AA 


1544 


D VOD 

RXRB 


NM 021976 


21687229 


D-003444-02 


GCAAAGACCTTACATACTC 
GGAATCATTCTGTTTAATC 


1545 
1546 


DVDD 


NM 021976 


21687229 


D-003444-03 


TCACACCGATCCATTGATG 


1547 


FOCRB 


NM 021976 


21687229 


D-003444-04 


GGAAACGGCTATGTGCAAT 


1548 














RXRG 


NM 006917 


21361386 


D-003445-01 


GGAAGGACCTCATCTACAC 


1549 


RXRG 


NM 006917 


21361386 


D-003445-02 


CCGGATCTCTGGTTAAACA 


1550 


RXRG 


NM 006917 


21361386 


D-003445-03 


gcgagccattgtactcttt 


1551 


■vXRG 


NM 006917 


21361386 


D-003445-04 


gagccattgtactctttaa 


1552 














THRA 


NM 003250 






ovjACAAAGACGAGCAGTGT 


1553 


THRA 


NM 003250 


20127451 


D-003446-02 


GGAAACAGAGGGGGAAATT 


1554 



THRA 
THRA 

THRB 
THRB 

1 nr\D 


NM 003250 
NM 003250 

NM 000461 
NM 000461 
NM 000461 
NM 000461 


20127451 
20127451 

10835122 
10835122 
10835122 
10835122 


D-003446-03 
0-003446-04 

D-003447-01 
D-003447-02 
D-003447-03 
D-003447-04 


GTAAGCTGATTGAGCAGAA 
GAACCTCCATCCCACCTAT 

GAATGTCGCTTTAAGAAAT 
GAACAGTCGTCGCCACATC 
GGACAAGCACCAATAGTCA 
GTGGAAAGGTTGACTTGGA 


1555 
1556 

1557 
1558 
1559 
1560 














VUrs. 


NM 000376 


4507882 


D-003448-01 


TGAAGAAGCTGAACTTGCA 


1561 


\/r\D 
VUK 


KIR it f\r%n^n^^ 

NM 000376 


4507882 


D-003448-02 


GCAACCAAGACTACAAGTA 


1562 


VDR 


NM 000376 


4507882 


D-00'?44ft-n^ 


J OAA 1 tsO 1 A 1 GACGTGTGA 


1563 


VDR 


NM 000376 


4507882 


D-003448-04 


CCATTGAGGTCATCATGTT 


1564 



Table X 



Gene 

Symbol 

ABCB1 



ABCC1 



ABCG2 



KCNH2 



KGNH1 



CLCA1 



■■Mti. ' Senses'; ; 


:; SEQV:ID NO.:^ 


GACCAUAAAUGUAAGGUUU 


1565 


UAGAAGAUCUGAUGUCAAA 


i 1566 


GAAAUGUUCACUUCAGUUA ~ 1567 


GAAGAUCGCUACUGAAGCA ! 1568 ~ 








lGgaagcaacugcagagaca 


_ 1569 


GAUGACACCUCUCAACAAA 


1570 


UAAAGUUGCUCAUCAAGUU 


1571 


CAACGAGUCUGCCGAAGRA 


1572 




Sense • 




GCAGAUGCCUUCUUCGUUA 


1573 


AGGCAAAUCUUCGUUAUUA 


1574 


jGGGAAGAAAUCUGGUCUAA 1'^7S " 


rUGAGUCAUCCCAACAUUUA " 1575 ^ 




Sense 




CCGACGUGCUGCCUGAGUA 


1577 


GAGAAGAGCAGGGACACUU 


1578 


GAUCAUAGCACCUAAGAUA 


1579 


GCUAUUUACUGCUCUUAUU 


1580 


UCACUGGGCUCCUUUAAUU 


1581 


GUGCGAGCCUUCUGAAUAU 


1582 


GCUAAGCUAUACUACUGUA 


1583 


UGACGGCGCUCUACUUCAC 


1584 




Sense . " ■ ■•■ ■ 




GAGAUGAAUUCCUUUGAAA 


1585 


GAAGAACGCAUGAAACGAA 1 1536 


IGAUAAAGACACGAUUGAAA I' 1'5ft7 • 


GCUGAGAGGUCUAUUUAAA l«?ftfl 




Sense 




GAACAACAAUGGCUAUGAA 


1589 


GUACAUACGUGGCUGGAUU 


1590 



SLC6A1 



SLC6A2 



SLC21A2 



SLC21A3 



1 GAACAGCUCACAAGUAUAU 


1591 


fGGAAACGUGUGUCUAUAUU 


1592 1 




Sense 




) GGAGGUGGGAGGACAGUUA 


1593 


i UCACAGCCCUGGUGGAUGA 


1594 


GAAGCUGGCUCCUAUGUUC 


: 1595 


GGUCAACAGUACCAACAUG 


1596 




. Sense 




GAACACAAGGUCAACAUUG 


1597 


AGAAGGAGCUGGGCUAGUG 


1598 


CGGAAACUCUUCACAUUUG 


1599 


) CAACAAAUUUGACAACAAC 


1600 




'W;:-- - ■ Sen^e' . • • V-g : 




1 GUACAUCUCCAUCUUAUUU 


1601 


i GGAAGUGGCUGAGUUAUUA 


1602 


: GMGlGGAGGCUCAAUGUAA 


1603 


i GAAGGAAGUGGCUGAGUUA 


1604 




Sense .■■>?•■■. 




GUAGAAACAGGAGCUAUUA 


1605 


CAAGAUUAGUGUCAAACAA 


1606 


GCACAAGAGUAUUUGGUAA 


1607 


GCAAAUGUCCCUUCUGUAU 


1608 


GCAUGACUCCUAUAUAAUA 


1609 


AAACAGCAAUUUCCCUUAA \ 


1610 i 


GAAAAUGCCUCUUCAGGAA 


1611 



SLC28A1 



1 Sense 


. 1 i . 


GUUCAUCGCUCUCCUCUUU 


' 1612 


GGAUCAA6CUGUUUCUGAA . -r 


1613 


GGACUGCAGUUUGUACUUG 


i 1614 


• GAGUGAAACUGACCUAUGG 


! 1615 



SLC29A1 



■ Sense ■ | • 


GAACGCUGCUCCCGUGGAA 




1616 


GAAAGCCACUCUAUCAAAG 




1617 


GAAACCAGGUGCCUUCAGA 




1618 


CCUCACAGCUGUAUUGAUG 


1 


1619 



SLC26A1 f 



SLC26A2 



1 Sense ■ ' . 1 


i CCACGGAGCUGCUGGUCAU 


1620 


G6GUUGACAUCUUAUUUGA 


1621 


GCACGAGGGUGUCUGUGUU 


1622 


GGCCAUCGCGUACUCAUUG 


1623 


CAACACCCAUGGCAAUUAA 


1624 


GAGGAAAGAUCUUGCUGAU 


1625 


GAGCAAGCGUCCUCCAAAU 


1626 


GCAACACCCAUGGCAAUUA 


1627 




Sense 




CCAAAGAACUCAAUGAACA 


1628 


ACAAGAACCUUCAGACUAA 


1629 


GAAGGUAGAUAGAAGAAUG 


1630 



! GUAUUGAACUGUACUGUAA 



1631 



SLC4A4 



GLRA1 



KLK1 



ADAM2 



XPNPE 
PI 



GZMA 



CMKLR1 



CLN3 



CALCR 



■ Sense 




GCAAUUCUCUUCAUUUAUC 


1632 


GGAAAGAUGUCCACUGAAA 


, 1633 


GGACAAAGGCUUCUUCAAU 


1634 


GGAAUGGGAUCCAGCAAUU 


1635 




Sense 




UGAAAGCCAUUGACAUUUG 


j 1636 


CAGACACGCUGGAGUUUAA 


1637 


CAAUAGCGCUUUCUGGUUU 


1638 


GCAGGUAGCAGAUGGACUA 


1639 




1 Sense . '•■ 




! UCAGAGUGCUGUCUUAUGU 


1640 


1 CAACUUGUUUGACGACGAA 


1641 


UGACAGAGCCUGCUGAUAC 


1642 i 


AGGCGGCUCUGUACCAUUU 


1643 1 




Sense . 




GAAACAUGCUGUGAUAUUG 


1644 


GCAGAUGUUUCCUUAUAUA 1 645 ' 


CAACAGAGAUGGCAUGAUA 1 1646 


GAAAGGGGCUACAUUGAGA i 1647 




'ISense s 




GACCUGAGCUUCCCAACAA 


1648 


IGCGACUGGCUCAACAAUUA 


1649 


GAGAUUGCGUGGCUAUUUA i 1650 


GACAGCAACUGGACACUUA 1 1651 ' 




•)■'■■• '^'^CsSense • -.-Mr- 




GGAAGAGACUCGUGCAAUG 


1652 


GGAAGCAUGUGCCAAGUUG 


1653 


GAAGUAACUCCUCAUUCAA 


1654 


GAACUCCUAUAGAUUUCUG "* 


1655 




■• . ■'..•:'-;.Sense -i;. 


'. f. •. 


CAUAGAAGCUUUACCAAGA 


1656 


rGAAUGGAGGAUGAAGAUUA 


1657 


GGUCAAUGCUCUAAGUGAA 


1658 


GAGAGGACUUCUAUGAAUG 


1659 




Sense 




CAUCAUGCCUUCUGAAUAA 


1660 


^CAACAGCUCAUCACGAUUU 


1661 


iGCAACAACUUCUCUUAUGU 


1662 


GGUCUUCGCUAGCAUCUCA 


1663 




Sense 




GGACCUAGCUGUUGUAAAG 


1664 


GAAAGACCAUGCAUUUAAA 


1665 


GCAGGAAGAUGUAUGCUUU | 


1666 i 


.GAAUAAACCAGUAUGGUUA 


1667 



OXTR 



ED64 



EDG5 



EDG7 



PTCH 



SMO 



CASP3 



CASP6 



CASP7 



Sense | 


' GGACCCAGAUAUCCAAAUA 


1 1668 


GCAAUACUAUCCUAACUGA 


1669 


GAAUAUAGAUUAGCGUUUG 


1 1670 


GAUGAGGCAU6ACUACUAA 


1671 




Sense 




GCGAGUCUGUCCACUAUAG 


1672 


; GAGAACGGCCACCCACUGA 


1673 


1 GAACGGCCACCCACUGAUG 


1674 


! GGUCAAUGCUGCUGUGUAC 


1675 




' '^Wi ..Sense;;? .• -9^ . 


UCCAGGAACACUAUAAUUA 


!_ 1676 


GUGACCAUCUUGUCCAUCA 


. 1677 


CAUCCUCUGUUGCGCCAUU 


1678 


CCAACAAGGUCCAGGAACA 


1679 




1 &I Sense • . ' 




1 ACACUGAUACUGUGGAUGA 


1680 1 


AAUAGGAGCAACACUGAUA 


1681 i 


: CAGCAGGAGUUACCUUGUU 


1682 ! 


i GGACACCCAUGAAGCUAAU 


1683 1 




fe: --i. Sense:. . /^i . ■ '. 




GCACAGAACUCCACUCAAA 


1684 


iGGACAGCAGUUCAUUGUUA 


1685 


GAGAAGAGGCUAUGUUUAA 


1686 i 


iSGACAAACUUCGACCCUUU 


1687 




"'U ■ . Sense'"' ■■■ '-fc -f^ 


-^-^ i 


UCGCUACCCUGCUGUUAUU 


1688 i 


GCUACAAGAACUACCGAUA 


1689 


CAAGAAAGCUUCCUUGAAC 


1690 


\ GAGAAGAAAUAGAGUGAAU 


1691 




Siense 




GAAUAUAUCUGAAGAGCUA 


1692 


GAAGUGGACUGUGGCAUUG 


1693 ; 


GUGAGAAGAUGGUAUAUUU 


1694 


GAGGGUACUUUAAGACAUA 


1695 




Sense 


CAUGAGGUGUCAACUGUUA 


1696 


GAAGUGAAAUGCUUUAAUG | 


1697 1 


AAAUAUGGCUCCUCCUUAG 


1698 i 


GCAAUCACAUUUAUGCAUA ! 


1699 


CAAGAUAAGUGAGGUGGAU 1 


1700 


GAUGGUACAUUCAAGAUUU 1 


1701 




Sense 




GAACUCUACUUCAGUCAAU 


1704 


GGGCAAAUGCAUCAUAAUA 


1703 


CAAGAGAGGGAGUUUAAUA 


1704 


GAACAAAGCCACUGACUGA 


1705 



CASP8 



CASP9 



DVL1 



DVL2 



PTEN 



PDK1 



PDK2 



PPP2CA 



CTNNA1 



Sense 



GAAGUC5AAPI lAI mAAnr iaa 


1706 


OA AO A A/^O AI IOA/^A A/^a a a 


1707 


_wjMUMMMo U UUAUOAAA UG 


1708 


GAGGf^l IPHAMOAI lOllAI II lA 


4. 1709^ 


GAAUAUAGAr^f^riPl II Iai ioa 


J 1710 


CAAPf^APIIAiir^AAriA AI 11 lo 


■ 1711 


GAAGUGAGCAGAUCAGAAU 


i 1712 


1 GAGGAAAUCUCCAAAUGCA 


1 1713 




Sense 1 


CCAGGCAGCUGAUCAUAGA 


: 1714 


ruCUCAGGUGUUGCCAAAUA 


1715 


GAACAGCUGUAAUCUAUGA 


_ 1716 


CCACUGGUCUGUAGGGAUU 


1717 




■ % .;Sense ' 




UCGUAAAGCUGUUGAUAUC 


^ 1718 


GAGGAGAUCUUUGAUGACA 


1719 


GUAAAGCUGUUGAUAUCGA 


1720 


GAUCGUAAAGCUGUUGAUA 


1 1721 




1 ■ t Sense, -y. ^'r.l 


I AGACG AAGGUGAUUUACCA 


! 1722 


UGUGAGAGCUACCUAGUCA 


1723 


GAAGAAAUUUCAGAUGACA 


i_ 1724 


[UAAUAGGCAUUUCCUGUUU 


1725 




.5.- ■>,! S; Sense. ■•■ '^^.^ 




GUGAAGAUCUUGACCAAUG 


1 1726 


GAUCAGCAUACACAAAUUA 


1727 


GAAUGAACCUUCUGCAACA 


1728 


1 GGCGCUAUGUGUAUUAl 11 lA 


i 1729 




■^ensev. ■ i ■ 




GUAGAAAGCUGGUAUAUCC 


1730 


GAAAGACUCCCAGUGUAUA 


1731 


GGAAGUCCAUCUCAUCGAA ] 


1732 


JCCAAAGACAUGACGACGUU 


1733 




Sehse-- 




GUAAAGAGGAGACUGAAUG 


1734 


iGGUCUGUGAUGGUCCCUAA 


1735 


iCAAAGAUGCGUACGACAUG ' 


1736 


LGGGCGAUGCCUGAGGGUUA i 


1737 




Sense 1 




UCACACAAGUUUAUGGUUU i 


1738 


CAACAGGGGUGACCACUUU j 


1739 


UAACCAAGCUGCAAUCAUG T 


1740 


GAACUUGACGAUACUCUAA ' 


1741 




1 Sense |. 


, GAAGAGAGGUCGUUCUAAG 


1742 


lAAGCAGAUGUGCAUGAUUA ! 


1743 


UCUAAUAACUGCARIIftlllln i 


1744 



GUAAAGGGCCCUCUAAUAA 



1745 



'H - ■ Sense 


■ 


GAAAGAAUAUGCCCAAGUU 


1746 


GAAGAAGAAUGCCACAAUG 


1747 


lgcaggaagauuaugaugug 


1748 


1 aaagaaagcgcauguacua 


1749 




Sense 


GGGAAAGAGCUGCAUAUUA i 1750 _| 


GCUUAGAACUCUUUACUGA ! 1751 


UAUAAGAGCUUGACCAAUG i 1752 i 


GCAGAUAUCUCUAUGAUUG 1 1753 i 




Sense : 




CAACUCAUGUCGAAUAGUG 


1754 


GGAAUGAGGGAGAUGUUUA 


1755 


GGAGACAGGUGUACGUUGU 


1756 • 


nDcCAAGAGCUGACAACUGA 


1757 




W^M- Senseii/"^ 




GUAAGGAGAAGCAAUAUAA 


1758 I 


AAGAUGAGCUUUCCAUGUA 


1759 


GGAGAUCUAUCUCAUCAUU 


1760 


GACAAGAGCGCACAGAGUA 


1761 




•' Sensi • .iMlE? 




GUACUUCCCUGAGGCCUAU 


1762 i 


GCUGUGCGUUGACUACGUA 


1763 


GUACUUCCCUCAGGGCUAU 1764 


GGUCAGGUGGCUCGAGAUC \ 1765 




% ■: : ^Sense ; 1: ' i ' 




CAGCGUAACUUCAUUCUUC 


1766 


UAACUUCAUUCUUCAGGUA 


1767 


CAGCUGCUCUUACCCAUUU 


1768 


GAUUGAAGCUAUUACUGAA 


1769 


UAGAAGAGCUGGGUCAGAA 


1770 


CCAGAUAUGCGUUGAUUGA 


1771 


GCGCAGGGCUCUCUACCUA 


1772 




Sehs^ 




GAACAGCCUUGAAAUCAAA , 


1773 


GAUGUUCUCUACUAUGUUA 


1774 


GCAAAUACUGGAAGGAUUA 


1775 


CAGGAAAGGUCGUAAUUUA [ 1776 




Sense 




CCACACGGCUGACCUCAUA 


1777 


CAGCAGAAUUCCUCUUAUA 


1778 


GCAGAAUUCCUCUUAUAAA 


1779 


GAUCGGGAUUUAUUUCUAU 


1780 




• Sense 




UGUGGGAGCUGAUGACUUU 


1781 


UCACAGAGAUGUUGAAAGG 


1782 



SOS1 



BRCA1 



CDKN1A 



CDKN1B 



SLC2A4 



N0S2A 



FRAP1 



FKBP1A 



UGGAAGAGAUCACAGGUUA 


1783 


GCUCAUCGCUCACAACCAA 


1784 1 




Sense 




GAGCACCACUUCUAUGAUU 


1785 


CAAAGAAGCUGUUCAAUAU 


1786 


UGAAAGCCCUCCCUUAUUA 


1787 '\ 


GAAAUAGCAUGGAGAAGGA 


1788 




Sense 




CCAUACAGCUUCAUAAAUA 


1789 


GAAGAGAACUUAUCUAG UG 


I 1790 


GAAGUGGGCUCCAGUAUUA 


1791 


GCAAGAUGCUGAUUCAUUA 


1792 


GAAGUGGGCUCCAGUAUUA 


1793 


GAACGGACACUGAAAUAUU 


1794 


GCAGAUAGUUCUACCAGUA 


1795 




' ■■ Sense'^v'-v. ife ■: 




GAACAAGGAGUCAGACAUU 


1796 


AAACUAGGCGGUUGAAUGA 


1797 


GAUGGAACUUCGACUUUGU 


1798 


1 GUAAACAGAUGGCACUUUG 


1799 ! 




; . 'Sense '■■ 




GGAAUGGACAUCCUGUAUA 


1800 


GGAGAAAGAUGUCAAACGU 


1801 


'gaauggacauccuguauaa 


1802 


, guaaacagcucgaauuaag j 


1803 




V; Sense 




' cagauaggcuccgaagaug 


1804 


agacucagcugcagaauac 


1805 


gaucgguucuuucaucuuc 


1806 


caggaucgguucuuucauc 


1807 




Sense •• ■ 1 




ccagauaagugacauaagu i 


1808 


: UAAGUGACCUGCUUUGUAA 


1809 


j gaagagagauuccauugaa 


1810 i 


lUGAAAGAGCUCAACAACAA 


1811 




Sense 




GAGCAUGCCGUCAAUAAUA 


1812 


CAAGAGAACUCAUCAUAAG 


1813 


j CCAAAGUGCUGCAGUACUA 


1814 


jUAAGAAAGCUAUCCAGAUU 1 


1815 




Sense 


GAAACAAGCCCUUUAAGUU i 


1816 1 


GAAUUACUCUCCAAGUUGA ! 


1817 


CAGCACAAGUGGUAGGUUA i 


1818 


GUUGAGGACUGAAUUACUC i 


1819 


GAUGGCAGCUGUUUAAAUG i 


1820 


GAGUAUCCUUUCAGUGUUA 


1821 1 



^Sense • . •" ^ 




CAAAGGAACCUACUUGUAC 


1822 


GGAACCUACUUGUACAAUG 


1823 


GAACCUACUUGUACAAUGA 


1824 


1 GAGUGUGUCUCGUGUAGUA 


1825 




■ Sense 




GGACAAGAAUCAAUGGAUA 


1826 


GAACAAGCCUCCAGGAUUC 1827 


GGACUUGUGUGCCCUUAUA 1828 


GAACACAAAGGCACUAUAA 1829 




i^Sense f- v 




CGAAGAAAGUGAUGAAUUU 


1830 


GCUCUUUGCCCAUCUCUUU 


1831 


UGAAAGACCUGGUGGAAGA 


1832 


GCAAUUCAGUUUCUACAUC 


1833 




1 -V' -i '% Sense • ';■ i ' 




! GAAGACAGAGUUAUUAAAC 


1834 


; UCACGAAGAGAGAGUUAUU 


1835 


AGACAGAGUUAUUAAACCA 


1836 , 


CACGAAGACAGAGUUAUUA 


1837' i 


GCUGAAGCCUGUCUGAUGU 


1838 i 




/ISense % •• 




CAAAUGAUCUGAGGCAGUU 


1839 


GUUCAUAGUUUGAGCGUUA 


1840 I 


GGAGAAACCUGUUGUGAUU 


1841 


GGACAAAGUUGCUGAAAUC 


1842 


fCAAAUGAUCUGAGGCAGUU 


1843 


GGAGAAACCUGUUGUGAUU 


1844 


GGACAAAGUUGCUGAAAUC 


1845 


GUUCAUAGUUUGAGCGUUA 


1846 




Sense t: "y- ■ ' .. - 


UGAAGCACCUUGAUCUUUG 


1847 ' 


GGGCAGCGCAUACCUGUUU 


1848 


GAGGAGCGCUGUUUGAGUU 


1849 


GGACGAGGAGCGCUGUUUG 


1850 


■GAGGAGCGCUGUUUGAGUU 


1851 


' GGAUGUCUCUCUCCUCUUU 


1852 


: GCUCACUCCUUUCUACUAA 


1853 


UGAAGCACCUUGAUCUUUG 


1854 




Sense;. 




GCACAGAUAUUUCCAUUUC 


1855 


GCAGUCCUCUUAUUCCUAA 


1856 


GAACUCAAGCUGCGUUUAU 


1857 


GGACGAAUUGAGAUAAUAU 


1858 



Sense 


UAAGAACACUGCUCAUGAA 


1859 


GAGGCAUCCUGAAGCAUUA 


1 1860 


GAAGGCGGCUGCAGAACUG 


! 1861 



HI 



I GGAA CAAGGAGAUCAUGUA " 



1862 



IKBKG 



RELA 



ARHA 



CDC42 



R0CK1 



PAK1 



PAK2 



TNFRSF 
5 



Sense | 


CUAUCGAGGUCGUUAAAUU 


1863 


GAAUGCAGCUGGAAGAUCU 


1 1864 


GCGGCGAGCUGGACUGUUU 


L 1865 


CCAGACCGAUGUGUAUUUA 


1866 ^ 




Sense 




[GGUCUCACCUCGCUAUGGU 


1867 J 


GAAAGCGAAUUCCUAGACA 


1868 


, GCACAAACAAGACUGAUGU 


1869 


GAAGGGCACCUCAGAAACA 


1870 ~ 1 


UCUCCCAACUUGUAUUAAA 


1 1871 ! 




' " rv? • Sense : 




UCAAGUGUCUUCCAUCAUG 


1872 ! 


[yCAAGUGCCUUAAUAGUAG 


1873 i 


GGAGUACCCUGAGGCUAUA 


1874 


GAUGAGAUCUUCCUACUGU 


1875 




: . 'Sense : 




GAGCUGGGCUAAGUAAAUA 


' 1876 


GACCAAAGAUGGAGUGAGA 


1877 


GGAAGAAACUGGUGAUUGU 


1878 


GGCUGUAACUACUUUAUAA 


1879 




■■■■••ii 4-!:. Sense •• 




GGACAUUUGUUUGCCAUUU 


1880 


GGAGAAGCAUAUACUCUUG 


1881 


GAACCAAUGCUUUCUCAUG 


1882 


GAAGACCUGUUAUGUA6AG 


1883 


GAUCAAGAAUUGCAAUAUC 


1884 


GAAAAGGGGUGACCUAGUA 


1885 


UGACAAACCUUAUGGAAAA 


1886 




1 • Sense ^v. 




GGAAUGAGCUUCAGAUGCA 


1887 


i GGACACAGCUGUAAGAUUG 


1888 


GACAAGAGAUUACAGAUAA 


1889 


GAAGAAACAUUCCCUAUUC 


1890 




Sense 1 




GAGGGUGGUUUAUGAUUAA 


1891 


CAACAAAGAACAAUCACUA 


1892 


GAAGAAAUAUACACGGUUU ~i 


' 1893 


UACAUGAGCUUUACAGAUA i 


1894 




Sense 




GGUAGGAGAUGAAUUGUUU 


1895 


AGAAGGAACUGAUCAUUAA 


1896 1 


CUACAGACCUCCAAUAUCA 


1897 


GAAACUGGCCAAACCGUUA 


1898 1 



PAK3 



Sense 



fGAliUAUCGCUGCAAAGGAA 
fGAGAGUGCCUGCAAGCUUIJ 



j GAC AAGAGGUGGCCAUAAA 



UUAAAUCGCUGUCUUGAGA 



1899 



1900 



1902" 



PAK4 



PAK6 



PAK7 



HDAC1 



HDAC2 



CREBBP 



BTRC 



RIPK2 



VAV1 



Sense 1 . -f 


ACUAAGAGGUGAACAUGUA 1903 


GAUCAUGAAUGUCCGAAGA 1904 
GAUGAGACCCUACUACUGA 1905 ' 


CAGCAAAGGUGCCAAAGAU j 1906 




Sense ,;> 




UAAAGGCAGUU6UCCACUA 


1907 


GAAGGGACCUGCUUUCUUG 


1908 


( GCAAAGACGUCCCUAAGAG 


1909 


CCAAUGGGCUGGCUGCAAA 


1910 




M-.u'-- iSerise - 




GAGCAGGGCUUUAAUAAGU 


1911 


CAAACUCCGUUAUGAUAUA 


1912 


IGGAUAAAGUUGUCUGAUUU 


1913 


GGAAAUGCCUCCAUAAAUA 


1914 




■'^ ' , Sense •A-;;^-, ' U'- ■ I 


GGACAUCGCUGUGAAUUGG 


1915 


AGAAAGAAGUCACCGAAGA 


1916 


GGACAAGGCCACCCAAUGA 


1917 


! CCACAGCGAUGACUACAUU 


1918 




Sense ■}$:. 




GCUGUUAAAUUAUGGCUUA 


1919 


' GCAAAGAAAGCUAGAAUUG 


1920 


CAUCAGAGAGUCUUAUAUA 


1921 


CCAAUGAGUUGCCAUAUAA 


1922 




Sense %y ■ 




GGCCAUAGCUUAAUUAAUC 


1923 


GCACAGCCGUUUACCAUGA 


1924 


GGACAGCCCUUUAGUCAAG 


1925 


GAACUGAUUCGUGAAAUAA 


1926 




Sense 




CACAUAAACUCGUAUCUUA 


1927 


IGAGAAGGCACUCAAGUUUA 


1928 


AGACAUAGUUUACAGAGAA 


1929 


GCAGAGAGAUUUCAUAACU 


1930 




Sense 




GAACAUACCUGUAAAUCAU 


1931 


rGGACAUGGAGCUGUUAUUA 


1932 


i UAAAUGAACUCCUACAUAG 


1933 


i GGAAUUAUCUCUGAACAUA 


1.934 




Sense 




GCAGAAAUACAUCUAGUAA 


1935 


GCUAUGAGGUGUUCUUCAA 


1936 



VAV2 



BRAF 



CGACAAAGCUCUACUCAUC 


1 1937 


GCUCAACCCUGGAGACAUU 


i 1938 




Sense . | . . 


GGACAAGACUCGCAGAUUU 


i 1939 J 


GCUGAGCGCUUUGCAAUAA 


i 1940 j 


CAAGAAGUCUCACGGGAAA 


1941 


UCACAGAGGCCAAGAAAUU 


1942 




Sense 




UGGAAGCCAUCGCCAAAUA 


1942 


CAUCAGUGCAUGACGUUUA 


, 1943 


UGAAUGAGCUGGUGGAUUA 


1944 


UGCCAAAACUUACCUAUAA 


1945 








GAGCUGCACUCCAAUGAGA 


1 1946 


GAAACCAAGCCAUUAAUGA 


1947 


CCAAGGAGCUACUGACAUU 


i 1948 


AGAGAAACAUGGCCCAAUA _ 


1 1949 




■ •: Sense ,■ " • " • - fai.-. «s : 


GCACAGACAUUUACAUUAA 


1950 


. GAAGGGAGUUUGCUAAAUU 


1951 


GAACAGAUCUGAUGAAUGA 


1 . 1952 


CAAGAGAGCUGAAGACUAU 


1953 




■ -i-Sen^i^, '/sr: • ■ • | t^. - 


GCAUAUAUAUUCAGCAUUG 


1954 


CAACUUGACUGCAGUAUUG 


1955 


GMCUUAACUUUCCAUGUU 


1956 


GACAAGACCUGUAGUAAUIJ 


1957 




Sense ^■::;v . | . ■•• w 


AGAAAGAGCUUGACAGUAA 


1958 


GGAAGUAGUUCACAAAAUA 


1959 1 


UGAAGUAUCUGUAUCCAAA 


1960 


GAGCUUCACUCCCUUAGUU 


1961 




•Sense V 




; UAAGGACUCUGAAGAUGUA 


1962 


GACAAAGUGUGUAAUUAUG ^ 


1963 


GCUCAGGACU UAGCAAGAA 


1964 


GAAACUGAAUACCUAAGAU 


1965 


GAAACUGAAUACCUAAGAU 


1966 


UAAGG ACUCUGAAGAUG UA 


1967 


UAGAAAGUGUGUAAUUAUG 


1968 


GGUCAGGACUUAGCAAGAA 


1969 




Sense 




GCAUCCAGCUGAUCCAGAA 


1970 


GAACCCUCCUGAUGAGAGU 


1971 


GAGGACAUCCACCAGUACA 


1972 




Sense 


GAyUAGAGACCAAGGAUUU | 



14U 



ELK1 



RALGDS 



PRKCA 



MAP2K4 



MAP2K7 



MAPK8 



MAPK9 



AIF1 



CCACUGAUGUGUGUUAAUU 


1973 j 


CAAUAGAACCUGUCAAUAU 


1974 1 


GAAGAGAGGAAUCGAAUGA 


1975 i 




Serise •;■ - 




GAUGUGAGUAGAAGAG UUA 


1975 


GGAAGAAUUU6UACCAUUU 


1976 


GAACGACCUUUCUUUCUUU ! 


1977 


GGAGUCAUCUCUUCCUAUA 


1978 




V' Sense 




GGAGAAGCCUCACCUCUUG 


1979 ! 


GCAGAAAGGACUCAAGAUU i 


1980 


GAGAACAAGUACUCAUUGA 


1981 


GAACUUCUGGUCACUGUAU 


1982 




V •. ■ ■ cSense ■ 1:'^ ' 




GGAUUGUUCUUUCUUCAUA 


1983 


GAAGGGUUGUCGUAUGUCA 


1984 


GAAGAAGGAUGUGGUGAUU 


1985 


GGACUGGGAUCGAACAACA i 1986 


i 


%y Sehle. . 


• .. 


■ GGACAGAAGUGGAAAUAUU 


1987 


UCAAAGAGGUGAACAU UAA 


1988 


GAGCAAAUCUCAGUUGUUU 


1989 


GGAGAAUGGUGCUGUUUAA 


1990 




■ . Sens^.:: .... a ; 




GAAGAGACGAAAGUAUAAU 


1991 


IGAAGACCGGCCACGUCAUU 


1992 


GGAAGAGACCAAAGUAUAA 


1993 1 


GCAUUGAGAUUGACCAGAA 


1994 ! 


UGAGAGAACGAGAAAGUUG 


1995 


GUGAAACCCUGUCUGCAUU 


1996 


GGAUCUCUCUCAACAACUA 


! 1997 


ACAACUAGGUGAACACAUA 


1998 




Seoise".' 




UCACAGUCCUGAAACGAUA 


1999 


GAUUGGAGAUUCUACAUUC 


2000 


GCUCAUGGAUGCAAAUCUU 


2001 


GAAGCUAAGCCGACCAUUU 


2002 




• Sense 




AAAGAGAGCUUAUCGUGAA 


2003 


GAUGAUAGGUUAGAAAUAG 


2004 


ACAAAGAAGUCAUGGAUUG 


2005 


GGAGCUGGAUCAUGAAAGA 


2006 




Sense 




GAAAAGGGAUGAUGGGAUU 


2007 


CCUAGACGAUCCCAAAUAU 


2008 


GAGCCAAACCAGGGAUUUA 


2009 


UGAAACGAAUGCUGGAGAA 


2010 


UCACUCACCCAGAGAAAUA 


2011 





1 CCAAGAAAGCUAUCUCUGA 


! 2012 




i AGACUCACCUAGAGCUAAA 


1 2013 






BBC3 


Sense 1 




CCUGGAGGGUCCUGUACAA 


2014 




GAGCAAAUGAGCCAAACGU 


2015 i 




GGAGGGUCCUGUACAAUCU 


2016 




GACUUUCUCUGCACCAUGU 


i 2017 






BCL2L1 


Sense 


• ;-| 




CCAGGGAGCUUGAAAGUUU 


2018 ! 




AAAGUGCAGUUCAGUAAUA 


: 2019 




GAGAAUCACUAACCAGAGA 


i 2020 




GAGCCCAUCCCUAUUAUAA 


1 2021 






BCL2L11 


■ ■ ■ > Sense ; i r-' - 


i 




GAGACGAGUUUAACGCUUA 


2022 




AAAGCAACCUUCUGAUGUA 


2023 




CCGAGAAGGUAGACAAUUG 


2024 




GCAAAGCAACCUUCUGAU6 


2025 




AGACAGAGCCACAAGGUAA 


2026 




GCAAGGAGGUUAGAGAAAU 


2027 




CAAGGAGGUUAGAGAAAUA 


2028 


1 UCUUACGACUGUUACGUUA 


2029 1 


BID 




^ Sense' k: ^' 






GAAGACAUCAUCCGGAAUA 


2030 




CAACAGCGUUCCUAGAGAA 


2031 1 




GAAAUGGGAUGGACUGAAC 


2032 




ACGAUGAGCUGCAGACUGA 


2033 . i 






BIRC2 


. " Sense '-' 'Uk-. ■ 






iGAAAGAAGCCUGCAUAUAA 


2034 




GAAAUUGACUCUACAUUGU 


2035 




ACAAAUAGCACUUAGGUUA 2036 




GAAUACACCUGUGGUUAAA 


2037 






BIRC3 


■ ■ ■ ^ Sense .iS;*- 






GGAGAUGGCUGCCAUUAAA 


2038 1 




UCAAUGAUCUUGUGUUAGA 


2039 




GAAAGAACAUGUAAAGUGU 


2040 1 




GAAGAAAGAACAUGUAAAG ! 


2041 1 






BIRC4 


Sense • .;• 






GUAGAUAGAUGGCAAUAUG 


2042 




GAGGAGGGCUAACUGAUUG 


2043 




GAGGAACCCUGCCAUGUAU 


2044 




GCACGGAUCUUUACUUUUG 


2045 






BIRC5 


Sense 






GGCGUAAGAUGAUGGAUUU 


2046 




GCAAAGGAAACCAACAAUA 1 


2047 




GCACAAAGCCAUUCUAAGU 


2048 




CAAAGGAAACCAAGAAUAA 


2049 






BRCA1 1 


Sense 


1 



CARD4 



CASP10 



CASP2 



CDKN1A 



CFLAR 



CLK2 



CLSPN 



CSNK2A 
1 



CCAU ACAGCU UC AUAAAUA 


2050 


GAAGAGAACUUAUCUAGUG 


2051 


GAAGUGGGCUCCAGUAUUA 


2052 ^ 


GCAAGAUGCUGAUUCAUUA 


2053 


CCAUACAGCUUCAUAAAUA 


2054 1 




Sense • 


• • 1 


GAAAGUUAAUGUCAAGGAA 


2055 j 


GAGCAACACUGGCAUAACA 


2056 


UAACAGAGAUUUGCCUAAA 


2057 


GCGAAGAGCUGACCAAAUA 


i 2058 




Sense v 




^AAGGGUUUCUCUGUUUA 


2059 


GAAAUGACCUCCCUAAGUU 


2060 


GAAGGCAGCUGGUAUAUUC 


2061 


GACAU6AUCUUCCU UCUG A 


2062 1 


GCACUCUUCUGUUCCCUUA 


2063 1 




. '.: ' "Sense -i 


'Si-' 


GUAUUAAACUCUCCUUUGA 


2064 


GCAAGGAGAUGUCUGAAUA 


2065 


CAACUUCCCUGAUCUUUAA 


2066 


GCUCAAAGAUGUAAUGUAG 


2067 




Sense-r«- "' 




GAACAAGGAGUCAGACAUU 


2068 


AAACUAGGCGGUUGAAUGA 


2069 


GAUGGAACUUCGACUUUGU 


2070 


lGUAAACAGAUGGCACUUUG 


2071 




%.}: Sense • if;',- 




GAUGUGUCCUCAUUAAUUU 


2072 


GAAGAGAGAUACAAGAUGA 


2073 


GAGCAUACCUGAAGAGAGA 


2074 


GCUAUGAAGUCCAGAAAUU 


2075 ! 




■ Sense . . ,. :vi - • 


gugaauaugugaaauagug 


2076 


AAAGCAUGCUAGAGUAUGA | 


2077 


.uuaagaauguggagaagua i 


2078 


IGAUAACAAGCUGACACAUA i 


2079 




1 Sense 




! GGACGUAAUUGAUGAAGUA 


2080 


i GCAGAUGGGUUCUUAAAUG 


2081 


[CAAAUGAGGUUGAGGAAAU 


2082 


IGGAAAUACCUGGAGGAUGA 


2083 




Sense 




GAUCCACGUUUCAAUGAUA 


2084 : 


GCAUUUAGGUGGAGACUUC 


2085 


GAUGUACGAUUAUAGUUUG 


2086 


UGAAUUAGAUCCACGUUUC I 2087 " 



CTNNB1 



CXCR4 



CXCR6 



DAXX 



GAS41 



GTSE1 



Sense 




CuOAGAAGAAUGGAUCACAA 


2088 


^^/^l 1/*^ AAA A 1 l/^i^ A III i/^i 1 A 

(jougaaacaug gag u ugua 


2089 


GUACGUACCAUGCAGAAUA 


2090 


GAACUUGCAUUGUGAUUGG 


2091 




Sense •, ■ . 




GAAGCAUGACGGACAAGUA 


2092 


GAACAUUCCAGAGCGUGUA 


2093 


IGUUCUUAGUUGCUGUAUGU 


2094 


I CAUCAUGGUUGGCCUUAUC 


2095 




Sense 




GGAACAAACUGGCAAAGCA 


2096 


GAUCAGAGCAGCAGUGAAA 


2097 


^GGCAAAACUGAAUUAUAA 


2098 


GAUCUCAGGUUCUCCUUGA 


2099 




; ■ . i Sense , '7^ '-.r 




CUACAGAUCUCCAAUGAAA 


2100 


GCUACAAGCUGGAGAAUGA 


2101 


GGAAACAGCUAUGUGGAAA i 2102 


GGAGUUGGAUCUCUCAGAA i 2103 




; Sense 




GUAGUAAGCUAAACUGAAA 


2104 


GACAAUAUGUUCAAGAGAA 


2105 < 


GACAACAUCUCGUCAGCUA 


2106 


UAUAUGAUGUGUCCAGUAA 


2107 




' . • ■ v^Sense ^^?Jr ' :r ■'■ ■ 




CAAAGAAGCUCACUUACUG 


- 2108 


GAACAGCCCUAAAGUGGUU 


2109 


GAACAUGGAUGACCCUAAG 


2110 


Igggcaaagcuaaaucaagu 


2111 



HDAC3 



HDAC5 



HEC 



Sense? : = . 


i .:• ^ . 


ggaaagcgauguggagauu 


2112.. _ 


ccaagaccguggccuauuu 


2113 


aaagcgauguggagauuua 


i 2114 


gugaggagcuucccuauag 


_.__^1J5 




Sense 




gaauuccucuugucgaagu 


2116 


guuauuagcacguuuaaga 


2117 


ggagggaggccaugacuug 


2118 


caggagagcucaagaaugg 


2119 


ggauauggauuucaguuaa 


2120 


ggaagucggugccuugguu 


2121 


iggaaggagaggacugguuu 


2122 




Sense - . 




gcagauacuuggacgguuu 


2123 


gaguagaacuagaauguga 


2124 


gcgaauaaaucaugaaaga 


2125 


gaagauggaauuaugcaua 


2126 



HIST1H2 


Sense 




AA 


GGCAAUGCGUCUCGCGAUA 


2127 




GAUCCGCAAUGAUGAGGAA 




2128 






GCAAUGCGUCUCGCGAUAA 


2129 i 




GAGGAACUCAAUAAGCUUU 


2130 1 



LMNB1 



Sense 




AAUAG AAGCUGUGCAAU UA 


2131 


CAACUGACCUCAUCUGGAA 


2132 


GAAGGAAUCUGAUCUUAAU 


2133 


i GGGAAGGGUUUCUCUAUUA 


2134 



LMNB2 



MYB 



MYT1 



NFKBIB 



• if uT' Senie. ':\ ' 1 




GGAGGUUCAUUGAGAAUUG ! 


2134 


GGCAAUAGCUCACCGUUUA 


2135 


CAAAUACGCUUAGCUGUGU 


2136 


GGAGAUCGCCUACAAGUUC 


2137 




■ ai:-' ■ . Serise^'-', ' '.s..-^': ; ■ 




GCAGAAACACUCCAAUUUA 


2138 


IGUAAAUACGUGAAUGCAUU 


2139 


j GCACUGAACUUUUGAGAUA 


2140 


1 GAAGAACAGUCAUUUGAUG 


2141 




■ rv i : Sense % . ' 




•■ GAGGUGAGCUGUUAAAUCA 


2142 


GCAGGGUGAUUUCCUAAUA 


2143 


GGGAGAAGAUAUUUAAUUG 


i 2144 


CAACUUCUCUCCUGAACUU 


! 2145 




■ ■ ■ •!?, •, Sense . ■ ^ ' ■ ■ .\. 


GGACACGGCACUGCAGUUG 


i 2146 


GCACUUGGCUGUGAUUCAU 


1 2148 


GAGACGAGGGCGAUGAAUA 


1 2149 


CAUGAAGCCUUCCUGGAUU 


1 2150 



NFKBIA 



■ •; • Sense ' . ':-.X i ' 


GAACAUGGACUUGUAUAUU 


i 21 


51 


GAUGUGGGGUGAAAAGUUA 


-1- 


52 


GGACGAGAAAGAUCAUUGA 


! 21 


53 


AGGACGAGCUGCCCUAUGA 


i 21 


54 



NFKBIE 



NUMA1 



NUP153 



Sense 




GAAGGGAAGUUUCAGUAAG 


2155 


GGAAGGGAAGUUUCAGUAA 


2156 


GGAAACUGCUGCUGUGUAG 


2157 


i GAACCAACCACUCAUGGAA 


2158 




Sense 




GGGAACAGUUUGAAUAUAA 


2159 


GCAGUAGCCUGAAGCAGAA 


2160 


CGAGAAGGAUGCACAGAUA 


2161 


GCAAGAGGGUGAGAGGAAA 


i 2162 




Sense | 



143 



0PA1 



A A A A A A 1 1/^ AAA #^ A%t I A A 

GAAGACAAAUGAAAGCUAA 


2163 


' GAUAAAGACUGCUGUUAGA 


2164 


A A A 1 1/^1 lA Al lAI ti lA 

(j AoGACaAGC UCU AAU AU UA 


2165 


GAGGAAGCCUGAUUAAAGA 


2166 




Sense 




GAAAGAGCAUGAUGACAUA 


2167 


1 GAGGAGAGCUCUAUUAUGU 


2168 


GAAACUGAAUGGAAGAAUA 


2169 


i AAAGAAGGCUGUACCGUUA 


2170 



PARVA 



Sense 




CUACAUGUCUUUGCUCUUA 


2171 


GCUAAGUCCUGUAAGAAUA 


2172 


CAAAGGCAAUGUACUGUUU 


2173 


GAACAAUGGUGGAUCCAAA 


2174 



PIK3CG 



PRKDC 



RASA1 



SLC9A1 



TEGT 



TERT 







AAGUUCAGCUUCUCUAUUA 


2175 


GAAGAAAUCUCUGAUGGAU 


! 2176 


GAACACCUUUACUCUAUAA 


2177 


GCAUGGAGCUGGAGAACUA 


2178 




1 Sense 




1 GAUGAAAGCUCUAAAGAUG 


2179 


1 GAAAGGAGGUUCUAAACUA 


2180 


1 GGAAGAAGCUCAUUUGAUU 


2181 


[gcaaagagguggcaguuaa , 


' 2182 




Sense 




GGAAGAAGAUGCACAUGAA 


2183 


GAACAUACUUUGAGAGCUU 


2184 


i GAACAAUCUUUGCUGUAUA 


2185 


1 UAACAGAACUGCUUGAACA 


2186 




^ ^"^ -vi' Sense ^ ■ '^"^^iilK-^^i 




\ GAAGAGAUCCACACACAGU 


2187 


1 UCAAUGAGCUGCUGCACAU 


2188 


1 GAAGAUAGGUUUCCAUGUG 


2189 


i GAAUUACCCUUCCUCAUCU 


2190 




;• .Sense 




CUAGAGAGCUUCAGUGUGA 


2191 


GAACAUAUUUGAUCGAAAG 


2192 


GAGCAAACCUAGAUAAGGA 


2193 


GCAUUGAUCUCUUCUUAGA 


2194 




Sense | 


GGAAGACAGUGGUGAAGUU 


2195 


GCAAAGCAUUGGAAUCAGA 


2196 


1 GAGCUGACGUGGAAGAUGA 


2197 


i GAACGGGCCUGGAACCAUA 


2198 



TNFRSF 
6 


Sense 






GAUACUAACUGCUCUGAGA 


2199 



140 



TOP1 



TOP2A 



T0P3A 



TOP3B 



WEE1 



GAAAGAAUGGUGUCAAUGA 


2200 


UCAAUAAUGUCCCAUGUAA 


2201 


UCAUGAAUCUCCAACCUUA 


2202 


GAUGUUGACUUGAGUAAAU 


2203 




Sense : 




GAAAGGAAAUGACUAAUGA 


2204 


GAAGAAGGCUGUUCAGAGA 


2205 1 


GGAAGUAGCUACGUUCUUU 


2206 


GG ACAU AAG UGG AAAG AAG 


2207 




.' Sense 




GAAAGAGUGCAUCAGAUUU 


2208 


CAAACUACAUUGGCAUUUA 


2209 ; 


AAACAGACAUGGAUGGAUA 


2210 j 


CGAAAGGAAUGGUUAACUA 


2211 




■'^^ : ' '1 Sense . ' V .. 




CCAGAAAUCUUCCACAGAA 


2212 


GAAACUAUCUGGAUGUGUA 


2213 i 


CCAGAAAGAUGGUAUCGUA 


2214 


GGAAAUGGCUGUGGUAAGA 


2215 




S^hia "Sense -•. 




GAGACAAGAUGAAGACUGU 


2216 


GCACAUGGGCUGCGUCUUU 


2217 


CCAGUGCGCUUCAAGAUGA 


2218 


! GAAGAUCUGCUUUGAGGUU 


2219 




"■■■^Hik: ■ Sense . 




GGUAUUGCCUUGUGAAUUU 


2220 


GCAGAACAAUUACGAAUAG 


2221 


GUACAUAGCUGUUUGAAAU 


2222 


GGUGUAAACUUGUAGCAUU 


2223 



In addition, to identifying functional siRNA against gene families or 
patliways, it is possible to design duplexes against genes known to be involved in 
5 specific diseases. For example when dealing with human disorders associated with 
allergies, it will be beneficial to develop siRNA against a number of genes including 
but not limited to: 
the interleukin 4 receptor gene 

(SEQ. ID NO. 2224: UAGAGGUGCUCAUUCAUUU, 
10 SEQ. ID NO. 2225: GGUAUAAGCCUUUCCAAGA, 
SEQ. IDNO. 2225: ACACACAGCUGGAAGAAAU, 
SEQ. ID NO. 2226: UAACAGAGCUUCCUUAGGU), 



the Beta-arrestin-2 



14/ 

(SEQ. ID NO. 2227: GGAUGAAGGAUGACGACUA, 
SEQ. ID NO. 2228: ACACCAACCUCAUUGAAUU, 
SEQ. ID NO. 2229: CGAACAAGAUGACCAGGUA, 
SEQ. ID NO. 2230: GAUGAAGGAUGACGACUAU, ), 

5 

the interferon-gamma receptor 1 gene 
(SEQ. ID NO. 2231: CAGCAUGGCUCUCCUCUUU, 
SEQ. ID NO. 2232: GUAAAGAACUAUGGUGUUA, 
SEQ. ID NO. 2233: GAAACUACCUGUUACAUUA, 
10 SEQ. ID NO. 2234: GAAGUGAGAUCCAGUAUAA), 

the matrix metalloproteinase MMP-9 

(SEQ. ID NO. 2235: GGAACCAGCUGUAUUUGUU, 
SEQ. ID NO. 2236: GUUGGAGUGUUUCUAAUAA, 
1 5 SEQ. ID NO. 2237: GCGCUGGGCUUAGAUCAUU, 
SEQ. ID NO. 2238: GGAGCCAGUUUGCCGGAUA), 

the Slcl lal (Nrampl) gene 

(SEQ. ID NO. 2239: CCAAUGGCCUGCUGAACAA, 
20 SEQ. ID NO. 2240: GGGCCUGGCUUCCUCAUGA, 
SEQ. ID NO. 2241: GGGCAGAGCUCCACCAUGA, 
SEQ. ID NO. 2242: GCACGGCCAUUGCAUUCAA), 

SPINK5 

25 (SEQ. ID NO. 2243: CCAACUGCCUGUUCAAUAA, 
SEQ. ID NO. 2244: GGAUACAUGUGAUGAGUUU, 
SEQ. ID NO. 2245: GGACGAAUGUGCUGAGUAU, 
SEQ. ID NO. 2246: GAGCUUGUCUUAUUUGCUA,), 

30 Ae CYP1A2 gene 

(SEQ, ID NO. 2247: GAAAUGCUGUGUCUUCGUA, 
SEQ. ID NO. 2248: GGACAGCACUUCCCUGAGA, 
SEQ. ID NO. 2249: GAAGACACCACCAUUCUGA, 
SEQ. ID NO. 2250: GGCCAGAGCUUGACCUUCA), 



tiiymosm-beta4Y 

(SEQ. ID NO. 2251: GGACAGGCCUGCGUUGUUU, 
SEQ. ID NO. 2252: GGAAAGAGGAAGCUCAUGA, 
5 SEQ. ID NO. 2253: GCAAACACGUUGGAUGAGU, 
SEQ. ID NO. 2254: GGACUAUGCUGCCCUUUUG, 

activin A receptor IB 

(SEQ. ID NO. 2255: ACAAGACGCUCCAGGAUCU, 
[0 SEQ. ID NO. 2254: GCAACAGGAUCGACUUGAG, 
SEQ. ID NO. 2255: GAAGCUGCGUCCCAACAUC, 
SEQ. ID NO. 2256: GCAUAGGCCUGUAAUCGUA, 
SEQ. ID NO. 2257: UCAGAGAGUUCGAGACAAA, 
SEQ. ID NO. 2258: UGCGAAAGGUUGUAUGUGA, 
1 5 SEQ. ID NO. 2259: GCAACAGGAUCGACUUGAG, 
SEQ. ID NO. 2260: GAAUAGCGUUGUGUGUUAU, 
SEQ. ID NO. 2261: UGAAUAGCGUUGUGUGUUA, 
SEQ. ID NO. 2262: GGGAUCAGUUUGUUGAAUA, 
SEQ. ID NO. 2263: GAGCCUGAAUCAUCGUUUA, ), 

20 

ADAM33 

(SEQ. ID NO. 2264: GGAAGUACCUGGAACUGUA, 
SEQ. ID NO. 2265: GGACAGAGGGAACCAUUUA, 
SEQ. ID NO. 2266: GGUGAGAGGUAGCUCCUAA, 
25 SEQ. ID NO. 2267: AAAGACAGGUGGCCACUGA), 

theTAPl gene 

(SEQ. ID NO. 2268: GAAAGAUGAUCAGCUAUUU, 
SEQ. ID NO. 2269: CAACAGAACCAGACAGGUA, 
30 SEQ. ID NO. 2270: UGAGAAAUGUUCAGAAUGU, 
SEQ. ID NO. 2271 : UACCUUCACUCGAAACUUA, 

COX-2 

(SEQ. ID NO. 2272: GAACGAAAGUAAAGAUGUU, 
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SEQ. IDNO. 2273: GGACUUAUGGGUAAUGUUA, 
SEQ. ID NO. 2274: UGAAAGGACUUAUGGGUAA, 
SEQ. ID NO. 2275: GAUCAGAGUUCACUUUCUU), 

5 ADPRT 

(SEQ. ID NO. 2276: GGAAAGAUGUUAAGCAUUU, 
SEQ. IDNO. 2277: CAUGGGAGCUCUUGAAAUA, 
SEQ. ID NO. 2278: GAACAAGGAUGAAGUGAAG, 
SEQ. ID NO. 2279: UGAAGAAGOJCACAGUAAA, ), 

10 

HDC 

(SEQ. ID NO. 2280: CAGCAGACCUUCAGUGUGA. 
SEQ. ID NO. 2281 : GGAGAGAGAUGGUGGAUUA, 
SEQ. ID NO. 2282: GUACAGAGCUGGAGAUGAA, 
1 5 SEQ. ID NO. 2283 : GAACGUCCCUUCAGUCUGU), 

HnmT 

(SEQ. ID NO. 2284: CAAAUUCUCUCCAAAGUUC, 
SEQ. ID NO. 2285: GGAUAUAUCUGACUGCUUU, 
20 SEQ. ID NO. 2286: GAGCAGAGCUUGGGAAAGA, 
SEQ. ID NO. 2287: GAUAUGAGAUGUAGCAAAU), 

GATA-3 

(SEQ. IDNO. 2288: GAACUGCUUUCUUUCGUUU, 
25 SEQ. ID NO. 2289: GCAGUAUCAUGAAGCCUAA, 
SEQ. ID NO. 2290: GAAACUAGGUCUGAUAUUC, 
SEQ. ID NO. 2291: GUACAGCUCCGGACUCUUC), 

Gab2 

30 (SEQ. ID NO. 2292: GCACAACCAUUCUGAAGUU, 
SEQ. ID NO. 2293: GGACUUAGAUGCCCAGAUG, 
SEQ. ID NO. 2294: GAAGGUGGAUUCUAGGAAA, 
SEQ. ID NO. 2295: GGACUAGCCCUGCUGUUUA), and 



STAT6 

(SEQ. IDNO. 2296: GAUAGAAACUCCUGCUAAU, 
SEQ. TD NO. 2297: GGACAUUUAUUCCCAGCUA, 
SEQ. ID NO. 2298: GGACAGAGCUACAGACCUA, 
5 SEQ. ID NO. 2299: GGAUGGCUCUCCACAGAUA). 

In addition, rationally designed siRNA or siRNA pools can be directed against 
genes involved in anemia, hemopMla or hypercholesterolemia. Such genes would 
include, but are not be limited to: 
10 APOA5 

(SEQ. ID NO. 2300: GAAAGACAGCCUUGAGCAA, 
SEQ. ID NO. 2301 : GGACAGGGAGGCCACCAAA, 

SEQ. ID NO. 2302: GGACGAGGCUUGGGCUUUG, 
SEQ. ID NO. 2303: AGCAAGACCUCAACAAUAU), 

15 

HMG-CoA reductase 

(SEQ. ID NO. 2304: GAAUGAAGCUUUGCCCUUU, 
SEQ. ID NO. 2305: GAACACAGUUUAGUGCUUU, 
SEQ. ID NO. 2306: UAUCAGAGCUCUUAAUGUU, 
20 SEQ. ID NO. 2307: UGAAGAAUGUCUACAGAUA), 

N0S3 

(SEQ,. ID NO. 2308: UGAAGCACCUGGAGAAUGA, 
SEQ. ID NO. 2309: CGGAACAGCACAAGAGUUA, 
25 SEQ.IDNO.2310:GGAAGAAGACCUUUAAAGA, 
SEQ. IDNO. 2309: GCACAAGAGUUAUAAGAUC), 

ARH 

(SEQ. ID NO. 23 1 0: CGAUACAGCUUGGCACUUU, 
30 SEQ. ID NO. 23 1 1 : GAGAAGCGCUGCCCUGUGA, 
SEQ. IDNO. 2312: GAAUCAUGCUGUUCUCUUU, 
SEQ. ED NO. 2313: GGAGUAACCGGACACCUUA), 



CYP7A1 



(SEQ. ID NO. 2314: UAAGGUGACUCGAGUGUUU, 
SEQ. ID NO. 2315: AAACGACACUUUCAUCAAA, 
SEQ. ID NO. 2316: GGACUCAAGUUAAAGUAUU, 
SEQ. ID NO. 2317: GUAAUGGACUCAAGUUAAA), 

5 

FANCA 

(SEQ. ID NO. 23 1 8: GGACAUCACUGCCCACUUC, 
SEQ. ID NO. 2319: AGAGGAAGAUGUUCACUUA, 
SEQ. ID NO. 2320: GAUCGUGGCUCUUCAGGAA, 
10 SEQ. ID NO. 232 1 : GGACAGAGGCAGAUAAGAA), 

FANCG 

(SEQ. ID NO. 2322: GCACUAAGCAGCCUUCAUG, 
SEQ. ID NO. 2323: GCAAGCAGGUGCCUACAGA, 
15 SEQ. ID NO. 2324: GGAAUUAGAUGCXJCCAUUG, 
SEQ. ID NO. 2325: GGACAUCUCUGCCAAAGUC), 

ALAS 

(SEQ. ID NO. 2326: CAAUAUGCCUGGAAACUAU, 
20 SEQ. ID NO. 2327: GGUUAAGACUCACCAGUUC,: * 
SEQ. ID NO. 2328: CAACAGGACUUUAGGUUCA, 
SEQ. ID NO. 2329: GCAUAAGAUUGACAUCAUC), 

PIGA 

25 (SEQ. ID NO. 2330: GAAAGAGGGCAUAAGGUUA, 
SEQ. ID NO. 2331: GGACUGAUCUUUAAACUAU, 
SEQ. ID NO. 2332: UCAAAUGGCUUACUUCAUC, 
SEQ. ID NO. 2333: UCUAAGAACUGAUGUCUAA), and 

30 factor VIII 

(SEQ. ID NO. 2334: GCAAAUAGAUCUCCAUUAC, 
SEQ. ID NO. 2335: CCAGAUAUGUCGUUCUUUA, 
SEQ. ID NO. 2336: GAAAGGCUGUGCUCUCAAA, 
SEQ. ID NO. 2337: GGAGAAACCUGCAUGAAAG, 



SEQ. ID NO. 2338: CUUGAAGCCUCCUGAAUUA, 
SEQ. ID NO. 2339: GAGGAAGCAUCCAAAGAUU, 
SEQ. ID NO. 2340: GAUAGGAGAUACAAACUUU). 

5 Fiirthennore, rationally designed siRNA or siRNA pools can be directed 

against genes involved in disordCTS of the brain and nervous system. Such genes 
would include, but are not be limited to: 
APBBl 

(SEQ. ID NO. 2341: CUACGUAGCUCGUGAUAAG, 
10 SEQ. n> NO. 2342: GCAGAGAUGUCCACACGUU, 
SEQ. ID NO. 2343: CAUGAGAUCUGCUCUAAGA, 
SEQ. ID NO. 2344: GGGCACCUCUGCUGUAUUG), 

BACEl 

1 5 (SEQ. ID NO. 2345 : CCACAGAGCAAGUGAUUUA, 
SEQ. ID NO. 2346: GCAGAAAGGAGAUCAUUUA, 
SEQ. ID NO. 2347: GUAGCAAGAUCUUUACAUA, 
SEQ. ID NO. 2348: UGUCAGAGCUUGAUUAGAA), 

20 PSENi .-.y 

(SEQ. ID NO. 2349: GAGCUGACAUUGAAAUAUG, 
SEQ. ID NO. 2350: GUACAGCUAUUUCUCAUCA, 
SEQ. ID NO. 2351: GAGGUUAGGUGAAGUGGUU, 
SEQ. ID NO. 2352: GAAAGGGAGUCACAAGACA, 
25 SEQ. ID NO. 2353: GAACUGGAGUGGAGUAGGA, 
SEQ. ID NO. 2354: CAGCAGGCAUAUCUCAUUA, 
SEQ. ID NO. 2355: UCAAGUACCUCCCUGAAUG), 

PSEN2 

30 (SEQ. ID NO. 2356: GCUGGGAAGUGGCUUAAUA, 
SEQ. ID NO. 2357: CAUAUUCCCUGCCCUGAUA, 
SEQ. ID NO. 2358: GGGAAGUGCUCAAGACCUA, 
SEQ. ID NO. 2359: CAUAGAAAGUGACGUGUUA), 



i3J 

MASSl . 

(SEQ. ID NO. 2360: GGAAGGAGCUGUUAUGAGA, 
SEQ. JD NO. 2361: GAAAGGAGAAGCUAAAUUA, 
SEQ. E) NO. 2362: GGAGGAAGGUCAAGAUUUA, 
5 SEQ. ID NO. 2363: GGAAAUAGCUGAGAUAAUG, ), 

ARX 

(SEQ. ID NO. 2364: CCAGACGCCUGAUAUUGAA, 
SEQ. ID NO. 2365: CAGCACCACUCAAGACCAA, 
10 SEQ.IDN0.2366:CGCCUGAUAUUGAAGUAAA, 
SEQ. ID NO. 2367: CAACAUCCACUCUCUCUUG) and 

NNMT 

(SEQ. ID NO. 2368: GGGCAGUGCUCCAGUGGUA, 
15 SEQ. ID NO. 2369: GAAAGAGGCUGGCUACACA, . 
SEQ. ID NO. 2370: GUACAGAAGUGAGACAUAA, 
SEQ. ID NO. 2371: GAGGUGAUCUCGCAAAGUU). 

In addition, rationally designed siRNA or siRNA pools can be directed against 
20 genes involved in hypertension and related disorders. Such genes would include, but 
are not be limited to: 

angiotensin II type 1 receptor 

(SEQ. ID N0..23.72: CAAGAAGCCUGCACCAUGU, ' 
SEQ. ID NO. 2373: GCACUUCACUACCAAAUGA, 
25 SEQ. ID NO. 2374: GCACUGGUCCCAAGUAGUA, 
SEQ. ID NO. 2375: CCAAAGGGCAGUAAAGUUU, 
SEQ. ID NO. 2376: GCUCAGAGGAGGUGUAUUU, 
SEQ. ID NO. 2377: GCACUUCACUACCAAAUGA, 
SEQ. ID NO. 2378: AAAGGGCAGUAAAGUUU), 

30 

AGTR2 

(SEQ. ID NO. 2379: GAACAUCUCUGGCAACAAU, 
SEQ. ID NO. 2380: GGUGAUAUAUCUCAAAUUG, 
SEQ. ID NO. 2381: GCAAGCAUCUUAUAUAGUU, 



1^4 

SEQ. ID NO. 2382: GAACCAGUOJUIJCAACUCA), and other related targets. 



Example XIII: Validation of Multigene Knockout using RabS and Eps 

Two or more genes having similar, overlapping functions often leads to 
genetic redundancy. Mutations that knockout only one of, e.g„ a pair of such genes 
(also referred to as homologs) results in little or no phenotype due to the fact that the 
remaining intact gene is capable of MfiUing the role of the disrupted counterpart. To 
folly understand the fonction of such genes in cellular physiology, it is often 
necessary to knockout or knockdown both homologs simultaneously. Unfortunately, 
concomitant knockdown of two or more genes is firequently difficult to achieve in 
higher organisms {e.g, mice) thus it is necessary to introduce new technologies dissect 
gene fiinction. One such approach to knocking down multiple genes simultaneously 
is by using siRNA. For example, Figure 11 showed that rationally designed siRNA 
directed against a number of genes involved in the clathrin-mediated endocytosis 
pathway resulted in significant levels of protein reduction (e.g. >80%). To determine 
the effects of gene knockdown on clathrin-related endocytosis, internalization assays 
were perfonned using epidermal growth factor and transferrin. Specifically, mouse 
receptor-grade EGF (Collaborative Research Lie.) and iron-saturated human 
transferrin (Sigma) were iodinated as described previously (Jiang, X., Huang, F., 
Marusyk, A. & Sorkin, A. (2003) Mol Biol Cell 14, 858-70). HeLa- cells grown in 12- 

well dishes were incubated with ^^^I-EGF (1 ng/ml) or I-transferrin (1 jxg/ml) in 
binding medium (DMEM, 0.1% bovine semm albumin) at 37°C, and the ratio of 
internalized' and surface radioactivity was determined during 5-min time course to 
calculate specific intemalization rate constant ke as described previously (Jiang, X et 
al). The measurements of the uptakes of radiolabeled transferrin and EGF were 
performed using short time-course assays to avoid influence of the recycling on the 
uptake kinetics, and using low hgand concentration to avoid saturation of the clathiin- 
dependent pathway (for EGF Lund, K. A., Opresko, L. K., Strarbuck, C, Walsh, B. J. 
& Wiley, H. S. (1990)/ Biol Chem. 265, 15713-13723). 

The effects of knocking down Rab5a, 5b, 5c, Eps, or Eps 15R (individually) 
are shown in Figure 22 and demonstrate that disruption of single genes has little or no 
effect on EGF or Tfii intemalization. In contrast, simultaneous knock down of Rab5a, 
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5b, and 5c, or Eps and Bps 15R, leads to a distinct phenotype (note: total 
concentration of siRNA in these experiments remained constant with that in 
experiments in which a single siRNA was introduced, see Figure 23). These 
experiments demonstrate the effectiveness of using rationally designed siRNA to 
5 knockdown multiple genes and validates the utility of these reagents to override 
genetic redundancy. 

Example XTV, Validation of Multigene Targeting Using G6PD, GAPDH, PLK, 
and UQC. 

1 0 Further demonstration of the ability to knock down expression of multiple 

genes using rationally designed siRNA was performed using pools of siRNA directed 
against four separate genes. To achieve this, siRNA were transfected into cells (total 
siRNA concentration of 1 OOnM) and assayed twenty-four hours later by B-DNA. 
Results shown in Figure 24 show that pools of rationally designed molecules are 

1 5 capable of simultaneously silencing four different genes. 

Example XV. Validation of Multigene Knockouts As Demonstrated by Gene 
Expression Profiling, a Prophetic Example 

To further demonstrate the ability to concomitantly knockdown the expression 
20 of multiple gene targets, single siRNA or siRNA pools directed against a collection of 
genes (e.g. 4, 8, 16, or 23 different targets) are simultaneously transfected into cells 
and cultured for twenty-four hours. Subsequently, mRNA is harvested from treated 
(and untreated) cells andJabeled with one of two fluorescent probes dyes (e.g. a red 
fluorescent probe for the treated cells, a green fluorescent probe for the control cells.). 
25 Equivalent amoimts of labeled RNA fiom each sample is then mixed together and 
hybridized to sequences that have been linked to a solid support (e.g, a slide, "DNA 
CHIP"). Following hybridization, the slides are washed and analyzed to assess 
changes in the levels of target genes induced by siRNA. 

30 Example XVI. Identifying Hyperfunctional siRNA 



Identification of Hyperfunctional Bcl-2 siRNA 

The ten rationally designed Bcl2 siRNA (identified in Figure 13, 14) were 
tested to identify hyperpotent reagents. To accomplish this, each of the ten Bcl-2 
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siRNA were individually transfected into cells at a 300pM (0.3nM) concentrations. 
Twenty-four hours later, transcript levels were assessed by B-DNA assays and 
compared with relevant controls. As shown in Figure 25, while the majority of Bcl-2 
siRNA Mled to induce functional levels of silencing at this concentration, siRNA 1 
and 8 induced >80% silendng, and siRNA 6 exhibited greater than 90% silendng at 
this subnanomolar concentration. 

By way of prophetic examples, similar assays could be performed with any of 
the groiqps of rationally designed genes described in Example VII or Example VIII. 
Thus for instance, rationally designed siRNA sequences directed against 
PDGFA 

(SEQ. ID NO. 2383: GGUAAGAUAUUGUGCUUUA, 
SEQ. ID NO. 2384: CCGCAAAUAUGGAGAAUUA, 
SEQ. ID NO. 2385: GGAUGUACAUGGCGUGUUA, 
SEQ. ID NO. 2386: GGUGAAGUUUGUAUGUUUA), or 

PDGFB 

(SEQ. ED NO. 2387: GCUCCGCGCUUUCCGAUUU, 

SEQ. ED NO. 2388: GAGCAGGAAUGGUGAGAUG, 

SEQ. ID NO. 23 89: GAACUUGGGAU AAGAGUGU, .: •. 

SEQ. ID NO. 2390: CCGAGGAGCUUUAUGAGAU, 

SEQ. ID NO. 2391: UUUAUGAGAUGCUGAGUGA) 

could be introduced intei^ells at inareasingly limiting concentrations to determine 
whether any of the duplexes are hyperfimctional. Similarly, rationally designed 
sequences directed against 
HIFl alpha 

(SEQ. ID NO. 2392: GAAGGAACCUGAUGCUUUA, 
SEQ. ID NO. 2393: GCAUAUAUCUAGAAGGUAU, 
SEQ. ID NO. 2394: GAACAAAUACAUGGGAUUA, 
SEQ. ID NO. 2395: GGACACAGAUUUAGACUUG), or 

VEGF 

(SEQ. ID NO. 2396: GAACGUACUUGCAGAUGUG, 
SEQ. ID NO. 2397: GAGAAAGCAUUUGUUUGUA, 
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SEQ. ID NO. 2398: GGAGAAAGCAUUUGUUUGU, 

SEQ. ID NO. 2399: CGAGGCAGCUUGAGUUAAA) could be introduced into cells 
at increasingly limiting concentrations and screened for hyperfunctional duplexes. 

Example XVII: Gene Silencine: Prophetic Example 

Below is an example of how one might transfect a cell. 

a. Select a cell line. The selection of a cell line is usually determined by the 
desired application. The most important feature to RNAi is the level of 
expression of the gene of interest. It is highly recommended to use cell lines 
for which siRNA transfection conditions have been specified and validated. 

b. Plate the' cells. Approximately 24 hours prior to transfection, plate the cells at 
the appropriate density so that they will be approximately 70 - 90% confluent, 
or approximately 1x10^ cells/ml at the time of transfection. Cell densities 
that are too low may lead to toxicity due to excess exposure and uptake of 
transfection reagent-siRNA complexes. Cell densities that are too high may 
lead to low transfection efficiencies and little or no silencing. Incubate the 
cells overnight. Standard incubation conditions for manamalian cells are 37°C 
in 5% CO2. Other cell types, such as insect cells, require different 
temperatures and CO2 concentrations that are readily ascertainable by persons 
skilled in the art. Use conditions appropriate for the cell type of interest. 

c. SiRNA re-suspension^^dd 20 |il siRNA universal buffer to each siRNA to 
generate a final concentration of 50 |xM. 



d. SiRNA-lipid complex formation. Use RNase-firee solutions and tubes. Using 
the following table. Table XI: 
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Table XI 




96-well : 24-well 


Mixture 1 (TransIT-TKO-PIasinid dilution mixture) 


Opti-MEM 


9.3 Hi 


46.5 111 


TransIT-TKO (1 ng/nl) 


0.5 Hi 


2.5 Hi 








Mix^tiure 1 iE^ai Yqlumi:^ . 


io:oui 


' 50.0 fU 



Mixture 2 (siRNA dilution mixture) 


Opti-MEM 


9.0 id 


45.0 Hi 


siRNA(l |iM) 


1.0 Hi 


5.0 Hi 


■Oiixtur^ ,2 Final Voliune 


:^:,.10.0h1- 


-50.0 Hl-i^; 




Mixture 3 (siRNA-Transfection reagent mixture) 


Mixture 1 


10 Hi 


50 H) 


Mixture 2 


10 Hi 


50 Hi 


.'SMix^re 3 fiihal %Iurtie' 


If 20^ ^i. 






Incubate 20 minutes at room temperature. 


Mixture 4 (Media-siRNA/Transfection reagent mixture) 


Mixtures 


20 Hi 


100 Hi 


Complete media 


80 Hi 


400 Hi 


: Ml3^tiire4':Finai Volu^. ...v 


iOOHlS 


500. jii' 




Incubate 48 hours at 3TC. 



5 Transfection . Create a Mixture 1 by combining tlie specified amounts of OPTI-MEM 
serum free media and transfection reagent in a sterile polystyrene tube. Create a 
Mixture 2 by combining specified amounts of each siRNA with OPTI-MEM media in 
sterile 1 ml tubes* Create a Mixture 3 by combining specified amounts of Mixture 1 
and Mixture 2. Mix gently (do not vortex) and incubate at room temperature for 20 

10 minutes. Create a Mixture 4 by combining specified amounts of Mixture 3 to 

complete media. Add appropriate volume to each cell culture well. Incubate cells 
with transfection reagent mixture for 24 - 72 hours at 3TC. This incubation time is 
flexible. The ratio of silencing will remain consistent at any point in the time period. 
Assay for gene silencing using an appropriate detection method such as RT-PCR, 

1 5 Western blot analysis, immunohistochemistry, phenotypic analysis, mass 



spectrometry, fluorescence, radioactive decay, or any other method that is now known 
or that comes to be known to persons skilled in the art and that from reading this 
disclosure would useful with the present invention. Tl;e optimal window for 
observing a knockdown phenotype is related to the mRNA turnover of the gene of 
5 interest, although 24 ~ 72 hours is standard. Final Volume reflects amount needed in 
each well for the desired cell culture format. When adjusting voftHnes for a Stoc^c' 
Mix, an additional 10% should be used to accommodate variability in pipetting, eta 
Duplicate or triplicate assays should be carried out when possible. 

10 While the invention has been described in connection with specific 

embodiments thereof, it will be understood that it is capable of further modifications 
and this application is intended to cover any variations, uses, or adaptations of the 
invention following, in general, tlie principles of the invention and including such 
departure firom the present disclosure as come within known or customary practice 

15 within the art to which the invention pertains and as may be applied to the essential 
features hereinbefore set forth and as follows in the scope of the upended claims. 
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A method for selecting siRNA comprising selecting an siRNA molecule of 
19-25 nucleoside bases, said method comprising: 

(a) selecting a target gene; 

(b) measuring the functionality of sequences of 19 - 25 nucleotides in 
length that are substantially complementary to a stretch of nucleotides 
of the target sequence, wherein said functionality is dependent upon 
non-target specific criteria. 

2. The method according to claim 1 wherein said functionality is detemiined 
by applying one of the following formulas: 

Formula I = -(GC/3) +(AUi5.i9) -(Tm2o°c)*3 -<G,3)*3 -(€,9) +(A,9)*2 +(A3) 
+(Uio)+(A,4)"(U5)-(An); 

Formula H = -(GC/3) -(AUi5-i9)*3 -<Gi3)*3 -(€,9) +(Ai9)*2 +(A3); 
Foraiula III = -(GC/3) -KAU15.19) -(Tm20'>c)*3; 

Formula IV = -(GC/2)+(AUi5-i9)/2-(Tm2ooc)*2 -<Gi3)*3 -(C19) +(Ai9)*2 
+(A3) +(U,o)+(Ai4)-(U5)-(An); 

Foraiula V = -(Gi3)*3 -(Gi9)..i^(Ai.9)*2 +(A3) + (UioMAm) -(U5) -(A,i); 
Formula VI = -(Gi3)*3 -(C19) +(Ai9)*2 +(A3); 

Formula Vn = ~(GC/2) +(AUi5.i9)/2 -( Tm2o«c)*l -(G,3)*3 -(C19) +(Aj9)*3 
+(A3)*3 +(Uio)/2+(Ai4)/2«{U5)/2-(A„)/2; 

wherein in Formulas I - VII: 



Claims 
1. 



AU15.19 = 0-5 depending on the number of A or U bases on the sense 
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strand at positions 15-19; 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its 
value is 0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise its 
value is 0; 

GC= the number of G and C bases in the entire sense strand; 
Tm 2o°c= 1 if the Tm is greater than 20^C; 

A3 = 1 if A is the base at position 3 on the sense strand, otherwise its 
value is 0; 

All = 1 if A is the base at position 1 1 on the sense strand, otherwise its 
value is 0; 

Ai4= 1 if A is the base at position 14 on the sense strand, otherwise its 
value is 0; 

Ai9 = 1 if A is the base at position 19 on the sense strand, otherwise its 
value is 0; 

Us = 1 if U is the base at position 5 on the sense strand, otherwise its 
value is 0; 

Uio= 1 if U is the base at position 10 on the sense strand, otherwise its 
value is 0; 



Formula VIII: (-14)*Gi3-13*Arl2*Ug^l4m^J0*Airl0*U4-10* 
9*Aio-9*U9-9*Ci8-8*Gio"7*Ur7*Ui6-7*Ci7-7*Ci9 
+7*Ui7+8*A2+8* A4 +8*A5+8*C4 +9*G8 +1 0* A7+1 0*Ui8+l 1 * A19+ 
ll*C9+15*Gi+ 18*A3+19*Uio-Tm-3* (GCtatai)-6*(GCi5-i9> 
30*X;and 

Formula DC: (14.1)*A3+(14.9)*A6+(17.6)*Ai3+(24.7)*Ai9+(14.2)*Uio+(10.5)* 
C9-K23.9)*G,-K163)'K}2+(-123)*An+(-193)*Ui+^ 
(-1 l)*U3+("15.2)*Ui5+(-l 1 .3)*Ui6+ (-1 1.8)*C3+(-17.4)*C6-K- 

10.5)*C7+ (-13.7)*Gi3-K-25.9)*Gi9-Tm-3* (GQotai)-6*(GCi5-i9> 

30*X 

wherein 



Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 
A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 
A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 
A4- 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
As = 1 if A is the base at position 5 of the srase strand, otherwise its value is 0; 
A6= 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 
A7 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 
Aio = 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 
All = 1 if A is the ^ase at position 1 1 of the sense strand, otherwise its value is 0; 
Ai3 = 1 if A is the base at position 1 3 of the sense strand, otherwise its value is 0; 
Ai9 = 1 if A is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 18 base pairs in length, its value is 0; 

C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 
C5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 
Ce = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 
C7 = 1 if C is the base at position 7 of the sense strand, oth^wise its value is 0; 
C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
Ci7 ^ 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
C18 = 1 if C is the base at position 1 8 of the sense strand, otherwise its value is 0; 
C 19 = 1 if C is the base at position 19 of the sense strand, otherwise if anotlier base 
is present or the sense strand is only^^&base pairs in length, its value is 0; 

Gi = 1 if G is the base at position 1 on the satise strand, otherwise its value is 0; 
G2= 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
Gio = 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
G 19 = 1 if G is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 18 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
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Us = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
5 Uio = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
Ui5 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 
U16 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 
Ui7 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
U18 = 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 

10 

GC15-19 = the number of G and C bases within positions 15 - 19 of the sense strand 
or within positions 1 5 -1 8 if the sense strand is only 1 8 base pairs in length; 

GQota! = the number of G and C bases in the sense strand; 

Tm= 100 if the targeting site contains an inverted repeat longer then 4 base pairs, 
1 5 otherwise its value is 0; and 

X = the number of times that the same nucleotide repeats four or more times in a 
row. 



3. A method of gene-silencing comprising selecting an siRNA according to 
20 . claim 2 and introducing it into a cell. 

4. The method according to claim 3 wherein said introducing is by allov^ing 
passive uptake of the siRNA. 



25 5. The method according to claim 3, wherein said introducing is through flie 
use of a vector. 



6. A method for developing an siRNA algorithm for selecting siRNA, said 
method comprising: 
30 (a) selecting a set of siRNA; • 

(b) measuring the gene silencing ability of each siRNA from said set; 

(c) determining the relative functionality of each siRNA; 

(d) determining the amount of improved functionality by the presence or 
absence of at least one variable selected from the group consisting of 



the total GC content, melting temperature of the siRNA, GC content at 
positions 15-19, the presence or absence of a particular nucleotide at 
a particular position and the number of times that the same nucleotide 
repeats within a given sequence; and 
(e) developing an algorithm using the information of step (d). 

A method of selecting an siRNA with improved functionality, said method 
comprising using the algorithm of claim 6. 

A method of selecting hyperfonctional siRNA, said method comprising 
using at least one functional siRNA, wherein at least one said functional 
siRNA has been selected according to the method of claim 7 and 
measuring the silencing ability of said at least one functional siRNA, 
wherein silencing ability is measured at a concentration of less than 1 
nanomolar siRNA, 

An siRNA molecule, wherein said siRNA molecule is effective at 
silencing Bcl-2. 

The siRNA molecule of claim 9, wherein said siRNA molecule comprises 
a sequence substantially similar to a sequence selected from the group 
consisting of GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 
GAAGUACAUCCAUUAUAAG (SEQv I!i)i;fel£L 302); 
GUACGACAACCGGGAGAUA (SEQ. IDNO. 303); 
AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 
UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 
CAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 
UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 
GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
GGAGAUAGUGAUGAAGUAC (SEQ. ID NO. 309); and 
GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 310). 

1 1 . The siRNA molecule of claim 1 0, wherein said siRNA molecule 
comprises a sequence selected from the group consisting of 
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GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 
GAAGUACAUCCAUUAUAAG (SEQ. ED NO. 302); 
GUACGACAACCGGGAGAUA (SEQ. ID NO. 303); 
AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 
UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 
GCAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 
UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 
GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
GGAGAUAGUGAUGAAGUAC (SEQ. ID NO. 309); and 
GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 3 1 0). 

The siRNA molecule of claim 11, '•wherein said siRNA molecule 
comprises GCAUGCGGCCUCUGUUUGA . 

The siRNA molecule of claim 9, wherein said siRNA molecule comprises 
a sense strand and an anti-sense strand. 

The siRNA molecule of claim 9, wherien said siRNA molecule comprises 
a hairpin. 

The siRNA molecule of claim 9, wherein said siRNA molecule comprises 
between 1 8 and 30 base pairs. 

A kit for gene silencing comprising at least one siRNA selected from the 
group consisting of sequences substantially similar to the group consisting 
of GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 
GAAGUACAUCCAUUAUAAG (SEQ. ID NO. 302); 
GUACGACAACCGGGAGAUA (SEQ. ID NO. 303); 
AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 
UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 
GCAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 
UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 
GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 



lOO 

GGAGAUAGUGAUGAAGUAC (SEQ. ID NO. 309); and 
GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 310). 

17. A method of gene silencing comprising using the siRNA molecule of 
claim 10. 

1 8. A method of gene silaicing comprising using the siRNA molecule of claim 
11. 

19. A kit, wherein said kit is comprised of at least two siRNA, whwein said at 
least two siRNA comprise a JBrst optimized siRNA and a second optimized 
siRNA, wherein said first optimized siRNA and said second optimized 
siRNA are optimized according to one of the following formulas: 

Formula I - -(GC/3) +(AUi5-i9) -(Tm2o-c)*3 -<Gi3)*3 -(Cg) +(A,9)*2 +(A3) 
+(U,o)+(A,4)-(U5)-(An); 

Formula II = -(GC/3) -(AU,s-i9)*3 -iGii)*3 -(Cw) -KAi9)*2 -KA3); 
Formula III = -(GC/3) +(AUi5-i9) -(Tm20''c)*3; 

Formula IV = -(GC/2)+(AU,5-i9)/2-(Tm2o»c)*2 -(Gi3)*3 -(C19) +(A|9)*2 
+(A3) +(U,o)+(Ai4)-(U5)-(Aii); ■ifc^.w 

Formula V = -(G,3)*3 -(C,9) +(Ai9)*2 +(A3) + (Uio)+(Ai4) -{Us) -(A, ,); 

Formula VI = -(G,3)*3 -(C19) +(Ai9)*2 +(A3); 

Formula VII = -(GC/2) +(AUis-i9)/2 -( Tmao-c)*! -(Gi3)*3 -(C19) +(Ai9)*3 
+(A3)*3 +(U,o)/2+(Ai4)/2 -<U5)/2 -<An)/2; 



wherein in Formulas I - VII: 



10/ 



AUi5-i9 - 0 - 5 depending on the number of A or U bases on the sense 

strand at positions 15-19; 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its 
5 value is 0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise its 

value is 0; 

GC = the number of G and C bases in the entire sense strand; 
Tm 2o«c = 1 if the Tm is greater than 20°C; 
10 A3 = 1 if A is the base at position 3 on the sense strand, otherwise its 

value is 0; 

An = 1 if A is the base at position 1 1 oh the sense strand, otherwise its 
value is 0; 

Ah= 1 if A is the base at position 14 on the sense strand, otherwise its 
15 value is 0; 

Ai9 = 1 if A is the base at position 19 on the sense strand, otherwise its 
value is 0; 

Us = 1 if U is the base at position 5 on the sense strand, otherwise its 
value is 0; 

20 Uio= lif U is the base at position 10 on the sense strand, otherwise its 

value is 0; 

or, 

25 Formula VIII: (-14)*Gi3-13*Ai-12*U7-l l*U2-10*Aii-10*U4-10*C3-10*C5.10*C6- 
9*Aio-9*U9-9*C,8-8*Gio-7*U,-7*U,6-7*Cn-7*Ci9 
+7*U,7+8*A2+8*A4+8*A5+8*C4+9*G8+10*Arfl0*U,8+ll*Ai^^ 
ll*C9-M5*Gi+ 18*A3+19*U,o-Tm-3* (GCtouO-6*(GCi5~i9)- 
30*X; and 

30 

Formula IX: (14.1)*A3-K14.9)*A6+(17.6)*Ai3+(24 J)*Ai9-K14.2)*^^ 

C9+(23.9)*Gi+(163)*G2+(-123)*Aii+(-193)'^,H-(.12J)*^^^ 



IDO 

(-ll)*U3+(-152)*Ui5+(-113)*Ui6+(-11.8)*C3+(-17.4)*^^^ 
10.5)*C7+ (-13.7)*Gi3+(-25.9)*G,9^Tin--3* (GQotai) - 6*(GCi5-i9)- 
30*X 

wherein 

5 Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 

A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 

A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 

A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 

A5 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
10 A6 = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 

Ay = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 

Aio = 1 if A is the base at position 10 of the sense strand; otherwise its value is 0; 

An = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 

Ai3 = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 
15 Ai9 = 1 if A is the base at position 1 9 of the sense strand, otherwise if another base 
is present or the sense strand is only 18 base pairs in length, its value is 0; 



C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 

C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 
20 C5 = 1 if C is the base atposition 5 of the sense strand, othawise its value is 0; 

C6 = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 

Ci7 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
25 C18 = 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, oth^wise if another base 
is present or the sense strand is only 18 base pairs in length, its value is 0; 



G| = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
30 G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
G8= 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
Gio = 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 



Gi9= 1 if G is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 18 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
5 U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
10 Uio = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
Ui5 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 
U16 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 
Ui7 = 1 if U is the base at position 17 on the sense strand, otiherwise its value is 0; 
U18 = 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 

15 

GCi5^i9 = the number of G and C bases within positions 1 5 - 19 of the sense strand 

or within positions 15 -18 if the sense strand is only 18 base pairs in length; 
GCtotai = the number of G and C bases in the sense strand; 
Tm = 1 00 if the targeting site contains an inverted repeat longer then 4 base pairs, 
20 otherwise its value is Q; iand 

X = the number of times that the same nucleotide repeats four or more times in a row. 
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High-throughput genomic technologies are 

revolutionizing modern biology. In particular, DNA 

microarrays have become one of the most povs^erfiil 

tools for profiling global mRN A expression in different 

tissues and environmental conditions, and for detecting 
single nucleotide polymorphisms. The broad 

applicability of gene expression profiling to the 

biological and medical realms has generated expanding demand for mass production of microarrays, 

which in turn has created considerable interest in improving the cost effectiveness of microarray 

fabrication techniques. We have developed the computational framev^ork for an optimal synthesis 

strategy for oligonucleotide microarrays. The problem was introduced by Hubbell et al Here, we 

formalize the problem, obtain precise boimds on its complexity and devise several computational 

solutions. 
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[> MTROBUCTION 

Oligonucleotide and cDNA microarrays can monitor 
mRNA expression levels for tens of thousands of genes 
simultaneously (!)• While both types of arrays are 
£qpplied to the elucidation of normal and pathological 
cellular mechanisms, the longer probes on cDNA arrays 
make them more susceptible to cross-hybridization, and 
oligo arrays are designed to reduce cross-hybridization 
and improve sensitivity (2). In addition, oligo microarrays can be used to detect polymorphisms (3) and, 
therefore, can greatly facilitate the research and diagnosis of genetic predisposition to diseases. Several 
large companies, such as Affymetrix, Coming, Motorola and Samsung, have established or are 
establishing the capability of manufacturing microarrays in large quantities. Thus, reductions in cost or 
time to manufacture can have a significant unpact on biotechnology and medicine. 

cDNA microarrays are produced by spotting pre-made cDNA solutions onto a glass or nylon surface via 
physical contact or ink-jet deposition. While oligonucleotides can also be synthesized and then spotted, 
oligo microarrays are usually manufactured by in situ synthesis, primarily via photolithography (4), and 
more recently by ink-jet deposition (5). In situ synthesis involves the consecutive addition of A, C, G 
and T nucleotides to the appropriate spots on the microarray. An important advantage of 
photolithographic synthesis over ink-jet deposition is that in a single cycle of synthesis, a nucleotide can 
be added to all desired spots on the array. This is achieved via photodeprotection of the target spots on 
tiie array surface with UV light prior to the addition of the nucleotide. Meanwhile, moM-ltargeit spots 
must be protected from the UV light using physical or virtual masks. The fabrication of physical masks 
is a laborious and costly process and one mask is needed for each cycle of ssoidiesis for each variety of 
arrays. Singh-Gasson et al (^) used a digital micro-mirror device to reflect light selectively onto the 
desired spots of an array, the 'virtual masking' strategy. Nonetheless, the deprotection step for each 
cycle lasts ~5 min and photolabile nucleosides are expensive. Therefore, decreasing the number of 
cycles required to synthesize a given set of sequences can reduce time and cost. Here, we address 
synthesis optimization, i.e., optimizing the order of nucleotide addition. 

The simplest strategy for synthesizing a given set of sequences is to add A bases wherever appropriate as 
the first base, then C, G and T bases, repeating this process for the second base, and so on. Chee et al 
(3) noted that if K is the length of the longest oligonucleotide to be synthesized, maximally 4ir cycles are 
required. Hubbell et al (7) observed that it would be possible to skip a synthesis cycle if a base is not 
needed by any oligonucleotides, or if the oligonucleotides that require the base can still be synthesized 
when that base is presented again later. In a parallel publication, Tolonen et al (8) observe that synthesis 
could be accelerated, even for a large set of oligonucleotides, if the order of base addition is tailored to 
the oligonucleotide sequences. Consequently, oligonucleotides can vary in length by more than one base 
at the end of every synthesis cycle. This observation has motivated the development of the optimal base 
addition strategy described in this paper. 
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A COMPUTATIONAL FRAMEWORK FOR OPTIMAL 
SYNTHESIS STRATEGY 



We formulate the question of devising an optimal 

synthesis strategy in oligo microarrays as a 

combinatorial state space search problem. This 
computational fomixdation provides insight into the 

complexity of the problem and enables a range of 

discrete optimization and heuristic search solutions. 
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In this paper, we assume that the input to the optimization software is a collection of A/^ oligo sequences 
of arbitrary length, which have been pre-selected in a probe selection process. For simplicity, we discuss 
the case of uniform length ATmers, but our framework is readily applicable to the more general case. An 
optimal synthesis strategy involves L cycles of synthesis where, in each cycle, a single and identical 
nucleotide is added to all unmasked oligos. The exact spatial location of each oligo on the array is not 
important, as long as it can be retrieved during actual synthesis. Therefore^ we assume that the input to 
the optimization code is a list of oligos arranged in one dimension^ such as shown in Figure lA. We 
define a strategy for constructing Kmers in L cycles as an I long vector 5, consistmg of elements A, C, G 
G and T. S[j] is the nucleotide added in cycle y . For instance, the vector [A,C,A,T,G] corresponds to 
using A, C, A, T and G in cycles 1-5, respectively. We define the height of each partially constructed 
oligo as the number of nucleotides that have been added thus far by the synthesis strategy. We define a 
frontier F of a partially synthesized array to be an integer vector of size where F[i] is the height of the 
/th oligo thus far. For example, the frontier of the four oligos in Figure IB is [3,1,1,2]. 
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Figure 1. Formulation of synthesis strategy. (A) Four 3mer 
oligonucleotides. (B) Four partially constructed oligos, 
defining frontier F= [3,1,1,2]. (C) The synthesis strategy 
[A,C,G,T] can synthesize the array in four cycles, instead of 
six cycles reqmred by the traditional approach. 



The 'traditionaP way to create a chip of N oligos, each of height K, is to perform 4K cycles of synthesis. 
After the addition of A, C, G and T to the appropriate spots of the array, all oligos will be one base long. 
We then proceed to synthesize layer two in four cycles and all oligos will be two bases long. We 
continue until the entire chip is synthesized in cycles. It is easy to observe that by a slight 
modification of the order of base addition (8), we can expedite the above process. As a simplified 
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example (Fig- iC), we can synthesize an array of 3 in four cycles using a modified synthesis 
strategy, compared to six cycles with the 'traditional' approach. 

In order to introduce the optimization framework for masking, we need to measure the 'work' that has 
been accomplished after several cycles of synthesis. We therefore give two definitions of 'frontier 
height' : (i) the min height of a frontier constructed after L cycles is the length of the shortest oligo [in 
Fig. IB, min height (I) - 1]; (ii) the sum height of a frontier is the sum of the lengths of all oligos 
constructed so far [in Fig. IB, sum height (I) = 3 + 1 + 1 + 2 = 7]. 

The objective optimization criterion we desire to minimize is the number of cycles required to create a 
frontier of min height - K, i.e., we seek the shortest length strategy vector that is sufficient to synthesize 
all oligos on the chip. It is obvious that the best possible strategy for a ^er oligo chip is of length 
between K and AK. It is easy to construct an example where the shortest strategy is of length AK (Fig. 
2A), although genomic sequences typically do not exhibit such extremely low complexity. In general, as 
the number of oligos on a chip grows, the length of the optimal strategy vector is expected to grow as 
well. 

Figure 2. Example arrays that challenge synthesis strategies. 
(A) A worst-case scenario requiring AK stages for Ainer 
synthesis. (B) The optimal solution for this chip requires 9 
cycles whereas the greedy solution produces a 12-cycle 
strategy. However, the sum height-based greedy solution 
produces the optimal synthesis strategy. (C) For this example, 
sum height heuristics create an 8-cycle strategy: 
AATAATAA. The exhaustive search produces a 7-cycle 
strategy: ATAATAA. 

► HEURISTIC SEARCH SOLU TIONS 

An obvious heuristic solution to devising the optimal 

synthesis strategy is a greedy search. In each synthesis 

cycle, we consider the four different options to extend 

the current layout and compute the height of the 

resulting frontier for each option. We choose the 

nucleotide that maximizes the height of the frontier, 
which could be either min height or sum height. The 

mm height heuristic extends the shortest oligo, while the sum height heuristic chooses the nucleotide that 

will add the largest number of nucleotides to the chip. 

The simple examples in Figure 2B and C show that a greedy search is not guaranteed to produce an 
optimal solution using either the min height or the sum height heuristic. In the remainder of this section, 
we consider ways to improve the greedy search. In the next section, we show that an efficient 
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polynomial time solution is unlikely to exist for this optimization problem. We then report simulation 
results that describe the effectiveness of the sum height heuristics. 

Look-ahead solution 

A natural way to extend the greedy algorithm is to consider a look-ahead strategy, (i) Generate all 
possible frontiers that can be generated in L cycles. The number of strategies is 4^. The number of 
frontiers might be smaller since different strategies may generate the same frontier, (ii) For each frontier, 
compute the height, (iii) Choose (for the first cycle) the strategy that maximizes the height after L cycles, 
(iv) Repeat until all oligos have been synthesized. 

When L = AKy this algorithm performs an exhaustive search. A rough upper bound on the running time 
of this algorithm is O {^N), which makes it prohibitive for large values of An alternative approach 
would be to use a variant of best-first search such as A"^, a popular algorithm m the artificial intelligence 
community. A more space-efficient alternative would be to use a branch-and-bound formulation, a 
standard approach in discrete optimization. 

Local search solution 

A local search attempts to improve a given solution by a series of local perturbations until a minimum is 
achieved. One obvious local search approach for our problem would be to repeatedly change a selected 
nucleotide in the strategy vector and accept the new strategy if it results in an improvement over the 
previous one. Here, we implement a variant (Fig. 3) based on steepest descent. 




Figure 3. A local search algorithm. 



View larger version (17K): 
[in this window] 
[in a new window] 



The steepest descent algorithm considers every possible local perturbation of a single nucleotide in all 
positions of the strategy vector and chooses the move that results in the greatest improvement. This 
algorithm terminates relatively quickly since there are at most A/ possible improvements to be made. 
Each iteration (steps 1-2) requires time 0(A/), so the total running time is 0(A/^). 

A simple classic variant of this algorithm is to accept a new strategy with some probability even if no 
improvement is observed. A Gibbs sampler is a special case of this solution when the probability of 
acceptance is a function of the degree of improvement, and positions for possible perturbations are 
selected at random rather than sequentially as described above. We describe simulation results using 
local search below. 
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. COMPUTATIONAL ANALYSIS OF OPTIMAL 
^ SYNTHESIS STRATEGY 

In this section, we provide a set of computational 
reductions that allow us to obtain a precise 
characterization of the complexity of the optimal 
synthesis strategy. We see this part as the main 
contribution of this paper. 

Multiple sequence alignment formulation 

We first observe that the optimal masking problem can be reduced to a special case of multiple sequence 
alignment, A precise description of multiple sequence alignment can be found in Gusfield (9) and 
Waterman (10). In particular, tiie best L cycle synthesis strategy directly corresponds to the optimal 
multiple alignment of the N oligos, where the costs of the alignment are defined as follows: (i) 
replacement cost = +oo; (ii) deletion cost = +oo; (iii) insertion cost " +1 . That is, the only allowed 
'editing' operation is the insertion of a gap» We first demonstrate this principle with an example. For the 
oligo design problem in Figure 2B, we first align the two oligos CCCAAA and AAACCC. An optimal 
alignment is given by 

CCCAAA 

AAACCC 

Walking across the alignment fi'om left to right creates the following syntiiesis strategy: 
[CCCAAACCC]. Anotiier optimal strategy is [AAACCCAAA]. 

The formal proof of this equivalence is not difficult. Each strategy corresponds to a multiple alignment 
obtained by aligning each sequence against the strategy sequence. Therefore, the shortest strategy 
corresponds to the shortest global alignment. 

This observation enables the application of computational solutions developed in multiple sequence 
analysis such as dynamic programming, Gibbs sampling and iterative refinement (S,10). A common 
greedy solution is based on aligning each pair, then producing a strategy based on the best aligned pair 
and subsequentiy continuing to add oligos to the alignment in the best-first manner. 

Shortest super-sequence formulation 

The above observation is usefiil for obtaining insight into the problem. By reducing our problem to a 
special case of multiple sequence alignment, we show that multiple alignment is 'harder', which does 
not preclude the possibility of an efficient solution to our specific problem. Here we show that the 
optimal synthesis strategy problem is exactly equivalent to the problem of computing the shortest super- 
sequence of a collection of strings. This two-way reduction establishes our problem to be as hard as the 
shortest super-sequence problem, which is known to be NP hard. 
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We informally define sequence to be a super-sequence of sequence Yif every character of K occurs in 
A" in the same order as they occur in Y. Similarly, we define a super-sequence of a collection of 
sequences where the above condition has to hold for each of the sequences. For instance, 
AAAACCCTTTT is a super-sequence of ACT, AAACCCT and AAAATTT. Sequence is the shortest 
super-sequence of a collection of sequences if and only if its length is the shortest among all super- 
sequences of the collection* 

Since the synthesis strategy must be a super-sequence of each of the oligos, the optimal synthesis 
strategy vector is equivalent to the shortest super-sequence of all oligos. The shortest super-sequence 
problem is known to be NP hard (11) and therefore the reduction above formally establishes the optimal 
synthesis strategy problem to be NP hard. More explicitly, the problem of finding a masking strategy of 
length L{L< 4K), given a collection of iV^ oligos of length AT, is NP complete. 

This is an unportant observation since it implies that the optimal synthesis strategy is unlikely to have 
efl&cient (sub-exponential) optimal solutions for a large number of oligos. It is easy, of course, to devise 
relatively efficient dynamic progranuning solutions when the number of oligos is constant (e.g. less than 
10). 



► SIMULATIONS WITH RANDOM OLIGOS 



We have conducted a large number of simulations to 

estimate the performance of several heuristic 

approaches to devising an optimal synthesis strategy. 

Here we report our results with three heuristic 

approaches, (i) Oblivious strategy: we shnply repeat 
synthesizing ACGTACGT... independent of the input 

sequences* (ii) Max simi height heuristics: we choose 

the nucleotide that maximizes the sum height in the next cycle (as outlined above), (iii) Randomized 

local search improvement: once a solution is obtained by the above two methods, we attempt to improve 

the solution using local search. 
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Our results comparing the oblivious and max sum height heuristics are summarized in Table 1. It is clear 
that for random oHgos there is no significant difference in performance between max height and 
oblivious heuristics. It is not particularly surprising for random oligos since, roughly speaking, every 
layer in the chip contains an approximately equal number of nucleotides of each type (A, C, G and T). 
Moreover, our results for *real' oligos appear to be consistent with this performance (data not shown). 
Note that one of the criteria for selecting oligos aims to prevent cross-hybridization between mRNA 
and multiple oligos* This puts 'selective pressure' on the design to ensure that oligos are as different as 
possible. As the number of oligos on the chips grows, they behave more and more like random oligos. 



View this table: Table 1. Comparison of the oblivious strategy with the max sum height 
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[in this window] heuristic 
[in a new window] 



We have interpolated the expected length of a strategy for 10 000 oligos and it appears to fit the 
following function well: 

f{K) = 2,5K = 4.04 

where K is the height of the oligos. 

Figure 4 shows an essentially linear fit of the data. The graph was produced by fitting the functionX^ = 
2.5K + C V A , where C is the single adjustable parameter. As a result, the fit is linear. The formal 
derivation that proves this expectation for max sum height is implied by the analysis in Jiang and Li 

01). 



Figure 4. A linear fit of the fimctionXA) = 2.5K + 4.04V^ to 
the simulation data produced by the oblivious strategy for 
1000 oligos. The scaling constant (4.04) depends on the total 
number of oligos. 




View larger version (lOK): 
[in this window] 
[in a new window] 



Now we provide a brief motivation for the above interpolating function. When we fmd the shortest 
alignment of a single random oligo JfjJfj— -^k ACGTACGT..., aligns with the first, second, third 
or fourth base, X2 aligns with the first, second, third or fourth base after X^^X^ aligns with the first, 
second, third or fourth base after etc. Thus the expected distance between X^ and ^Jfj ^ j is (1 + 2 + 3 + 
4)/4 = 2.5. For example, if Jfj = C, then the distance between JT. andX. + 1 is 1 if A". ^. j = T, 2 if ^ , = 
G, 3 if Jfj ^ J = A or 4 if Xj ^. J = C. The variance of the above possible distances is 1 .25. Therefore, we 

expect jXj^-^K f ®Q^ire an alignment of length 2.5A^ and, by the law of large numbers, a random 

I— 

oligo J!f|X2..Jfj^ requires an alignment of length at most 2.5A'H- 0( V A ) with high probability. In order 

to align N oligos, length 2.5K^ 0(logA^ ) suffices with high probability. For a fixed number of 
oligos, the logarithm term is constant. Therefore, we can use the formula 2.5K + 0( y/K). 
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We also used a simple local search to improve the solutions produced by the max sum height and 
oblivious strategies. The results are given in Table 2. We found that when the number of oligos is small 
(e.g. 100), the improvement was better (~3-5%). However, as the number of oligos grows (e.g. 1000), 
the percentage of improvement was reduced to '^1-1.5%. This is interesting, since Gibbs sampling is a 
close relative of local search and is a popular algorithm for multiple sequence alignment. However, in 
typical multiple alignments of proteins, we often align tens to hundreds of sequences. In this paper, we 
need to 'align' thousands to hundreds of thousands of oligos and it appears to have an impact on tiie 
degree of improvement obtained with this approach. 



View this table: Table 2. Results of local search improvement (LS) for both max sum height 
[in this window] (MSH) and oblivious solutions (ACGT) 
[in a new window] 
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In this paper, we have presented a computational 

formalization of the optimal synthesis strategy for 

oligonucleotide microarrays. We have shown that the 

problem is computationally intractable (NP complete). 

We have provided several simulation results that shed 
light on its practical complexity. As the number of 

applications of oligo microarrays increases and their 

use in diagnostic medical applications becomes a common practice, we expect the design of DNA chips 

to become more sophisticated and efficient. Our main conclusion from both the theoretical and 

simulation analyses provided in this paper is that the problem of optimal masking appears to be 

computationally difficult. Moreover, the simplest possible solution appears to work almost as well as 

more sophisticated approaches that include heuristic greedy approaches and local search. It would be 

interesting to see if more exhaustive approaches based on best-first search, branch-and-boxmd or Gibbs 

sampling methods will generate a more dramatic improvement in performance. Naturally, these results 

must be confirmed in the context of practically used DNA chips. 
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We have investigated the relative merits of two commonly used 
methods for target site selection for ribozymes: secondary structure 
prediction (MFold program) and in vitro accessibility assays. A total 
of eight methylated ribozymes with DNA arms were synthesized and 
analyzed in a transient co-transfection assay in HeLa cells. Residual 
expression levels ranging fh)m 23 to 72% were obtained with anti-PSKHl ribozymes compared to cells 
transfected with an irrelevant control ribozyme. Ribozyme eflScacy depended on both ribozyme 
concentration and the steady state expression levels of the target mRNA. Allylated ribozymes against a 
subset of the target sites generally displayed poorer efficacy than their methylated counterparts. This 
effect appeared to be influenced by in vivo accessibility of the target site. Ribozymes designed on the 
basis of either selection method displayed a wide range of efficacies with no significant differences in 
the average activities of the two groups of ribozymes. While in vitro accessibility assays had limited 
predictive power, there was a significant correlation between certain features of the predicted secondary 
structure of the target sequence and the efficacy of the corresponding ribozyme. Specifically, ribozyme 
efficacy appeared to be positively correlated with the presence of short stem regions.and helices of low 
stability within their target sequences. There were no correlations with predicted free energy or loop 
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length. 

t> INTRODUCTION 

Hammerhead ribozymes are potentially powerful tools for sequence- 
specMc inhibition of target gene expression (1). Their intrinsic 
cleavage activity makes them theoretically superior to traditional 
antisense oligodeoxynucleotides (ODNs) in terms of inhibitory 
capacity. Recent advances (i-3) have e?ctended the range of targets 
so far that virtually any limited stretch of RNA is now likely to contain a useful target However, other 
problems, including methods of delivery and target site selection, remain to be solved. The latter, in 
particular, appears to be a critical step in the design of antisense or ribo2yme molecules for suppression 
of target gene expression, as there may exist only a few sites within any mRNA that are accessible to 
hybridization (4,5). We have compared two commonly employed methods for the rational selection of 
target sites, secondary structure prediction and in vitro accessibility assays. As target we have used the 
mRNA for a novel human protein kinase, PSKHl (6)» 

RNA secondary structure prediction algorithms based mainly on energy minimization have intrinsic 
limitations, although subject to continuous improvement (7,8). An established theory for selection of 
target sites based on predicted structure is also required* The possible xnodulation of RNA structures by 
protein binding in vivo ( 9 AO) cannot yet be modeled and has so far limited their use in designing 
antisense ODNs or ribozymes. A positive correlation between the inhibitory effect of an antisense RNA 
and low local folding potential has been noted (high AG) (11). Together with more recent data (12-M), 
this suggests that extended single-stranded or unstructured regions may be the best targets for antisense 
ODNs. However, although ribozymes targeting predicted loop regions have proved efficacious in some 
cases (15,16), there are also examples of failures (17). A recent systematic analysis of the hybridization 
of tRNA-Phe to a set of complementary ODNs determined that all high-yield heteroduplexes involved 
RNA sequences forming both double-stranded stems and single-stranded regions, and that bases of the 
latter regions were often stacked onto the stems (18). This suggests the requirement for at least some 
degree of helical conformation in the secondary structures of favorable targets. 

The accessibility of different stretches of the mRNA for hybridization with short antisense ODNs may 
be determined in vitro by RNase H-mediated cleavage of the RNA at regions where the ODNs hybridize 
to the target transcript (14.19-22). Screenmg of a large set of ODNs targeting potential ribozyme 
cleavage sites has been employed to select the most promising sites for ribozyme targeting (19). 
Accessibility assays with specMe ODNs have also been performed in cell extracts on endogenous 
transcripts in order to better approximate the in vivo situation (14,22). The most accessible sites within 
any target RNA may be selected by performing the RNase H-assay with a randomized set of ODNs 
(20,21) or with a target-specific set of cDNA fragments prepared by partial DNase I-digestion (23). 
ODN libraries in previous studies (20,21) have been more or less completely randomized. We wanted to 
identify specaiFic sites amenable to cleavage with standard hammerhead ribozymes, and to restrict the 
analysis to those triplets that are cleaved most efficiently. Since the hammerhead ribozyme seems to 
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prefer a purine in the first position of the cleavage triplet and a C or an A in the third position (24), we 
decided to concentrate on the triplets GUC, GUA, AUG and AUA in our analysis. Four ODN libraries, 
each specific for one of these cleavage triplets, were synthesized and used to screen in vitro transcribed 
PSKHl UNA for accessible sites. 

Based on a study of the most active ribozymes expressed in vivo from a randomized library (25) , the 
ribozymes used in the present study were designed to have symmetric 8+8 nt arms (Fig. i) (26). 
Chemically synthesized ribozymes of similar arm lengths have subsequently been successfully 
employed (19,27-29), The stem II structure (Fig. V) was chosen to be 2bp long (30). The activity of such 
truncated ribozymes in cell culture has been demonstrated (27^29). For increased nuclease stability of 
the ribozymes, we have retained unmodified ribonucleotides in only five catalytically important 
positions (3132). Deoxyribonucleotides were used in the flanking arms (33.34) and 2'-0-alkylated 
ribonucleotides in the core and stem-loop II (31,32,25) (Fig. 1). Nuclease stability of ribozymes was 
increased further by an inverted thymidine at the 3'-end (28,29,32) and by a hexanol moiety at the 
extreme 5 -end (36). Ribozymes used in vivo frequently include short stretches of phosphorothioate 
linkages at the 5'-end (28.29)^ 3 -end (M) or both (27) for stabilization agamst exonucleases. We 
included two phosphorothioates at the 5'-end and one at the 3^-end. DNA nucleotides in the arms have 
been reported to increase catalytic efficiencies of ribozymes in vitro, most likely due to increased 
product dissociation rates (37-3^. DNA-armed ribozymes may recruit RNase H activity upon 
hybridization with the target RNA and thus enhance their apparent activity (40). We are here primarily 
interested in the accessibility of cleavage sites. Anything that reduces the importance of intrinsic 
ribozyme cleavage activity and increases the importance of target-specific inhibition of expression is 
desirable. 



3f 9^ 

•at 



Cut 1^ O 
e» »« a» 



View larger versioE (14K): 
[in this window] 
[in a new window] 



Figaar© 1. Schematic of the hammerhead ribozyme 
hybridizing to its target mRNA. The arrow indicates the 
position of cleavage in the mRNA. Numbering is according to 
the nomenclature of Hertel et al (26). Unmodified 
ribonucleotides are in bold lower case, 2 -0-alkylated 
ribonucleotides in plain lower case, and deoxynucleotides in 
upper case. Phosphorothioates are indicated by asterisk, while 
iT denotes an inverted 3'-3' thymidine. Bases of the GUM 
target triplet, where M is C or A, are boxed. 



Cellular delivery of ribozymes is commonly accomplished with various cationic liposome formulations 
(16.19.27-29). We have used the cationic liposome reagent lipofectamine to co-transfect HeLa cells 
with a mbcture of ribozyme, a firefly luciferase reporter gene construct containing the complete coding 
cDNA of the target PSKHl , and a Renilla luciferase-encoding plasmid serving as an internal 
transfection control. We have succeeded in constructing a ribozyme against PSKHl mRNA which 
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reduced the activity of the corresponding reporter gene to 20-25% in a concentration dependent manner. 
A correlation is observed with certain features of the predicted secondary structure of the target mRNA. 

> MATERIALS AN© METHOBS 

All restriction enzymes were from New England Biolabs. 

SyntSiesiis arad purii^catioini of lhatmiiiierllnead rEbo^gnes 
Automated RNA and DNA synthesis was carried out on an Applied 
Biosystems model 394 DNA/RNA synthesizer. 2'-0-Alkyl 
ribozymes containing five unmodified purine ribonucleotides were 
synthesized by solid phase B-cyanoethyl phosphoramidite chemistry (41), using the T-O-tert- 
butyldimethylsilyl protection strategy for the ribonucleotides (42.43). Syntheses were performed on 
controlled pore glass bearing an inverted thymidine linkage (Glen Research). A lipophilic capture tag, 1- 
[Diisopropyl(DL-<^tocopheryloxy)silyl]oxy-6-(2-cyanoethylA^,iV-diisopropylamino-phosphinoxy) 
hexane, was added at the S'-end of the oligomer as described QS)' Cleavage from the support and release 
of all base labile protecting groups (44.451 reverse phase HPLC purification (Pharmacia Source SRPC 
10/10 column, using a flow-rate of 1 ml/min, or a ^Bondai»c C-18 column), lyophilization, desilylation 
(46)t butanol-precipitation and counter-ion exchange with NaC104 were performed essentially as 

described (2^. Ribozymes, retaining a 5' hexanol-linker, were desalted (NAP-10 columns, Pharmacia) 
and quantified by UV spectroscopy. Molar extinction coefficients were calculated based on the nearest- 
neighbor method (Biopolymer Calculator at http://paris.chem.yale.edu ). Exact molecular weights were 
calculated. Ribozymes were controlled by denaturing 15% polyacrylamide gel electrophoresis prior to 
their application in cell culture experiments. 

Sequeince and modntetHioiin oiflkammerlliieadl riiboi^naies 

PSKHl-specifiic 2*-C>-methylated ribozymes (MRz) were designed targeting a total of eight sites. 
AUylated ribozymes (ARz) targeting a subset of these sites were also synthesized in order to evaluate the 
relative effects of the two types of modification on the efficacy of ribozymes. For both types of 
ribozymes, controls having the same chemical composition and length of hybridizing arms were 
designed targeting an irrelevant mRNA (human tissue factor). Ribozymes are named after their type of 
modification and cleavage position. Thus MRz-519 is a methylated ribozyme cleaving after guc519, 
while ARz-519 is the corresponding allylated ribozyme. The respective control ribozymes are MRz-TF 
and ARz-TF. The unique sequences (flanking arms) of the ribo^rmes ware as follows (the conserved 
sequence of the core is indicated by <core> for all but the first ribozyme sequence): MRz-TF/ARz-TF, 
A*A*T-C-T-C-C-T-c-u-g-a-u-g-a-g-g-g-u-u.a-c-c-g-a-a-a-C-T-T-A-G-T*G-iT;MRz-ii«,T*C»G-G-G- 
A-A-G- <Core> -a-C-C-T-T-G-C*T-iT; MRZ'238, C*C*G-G-C-T-T-T <Core> -a-C-A-G-G-G-C*C-iT; 
MRz-287, A*G*T^-G-G-G-G- <core> -a-C-C-G-G-G-G^C-iT; MRz-465/ARz-465, T*G*A-T-G-G-C- 
A- <core> -a-C-G-G-C-T-G*C-iT; MRz-SlWARz-SlKG'^C^h-G-C-l-C-C- <core> -a-C-T-C-A-C- 
A*C-iT; MRz-539/ARz-539, A*C*G-C-A-C-C-C- <core> -ffl-C-G-C-A-G-C*A.iT; MRz-5 48, G'^T'^T-G- 
G-C-A-T- <core> -a-C-G-C-A-C-C*C-iT; MRz-712, A*G*A.T-A-C-C-G- <core> -a-C-G-C-C-A-T*C- 
iT. Modified and unmodified ribonucleotides are in plain and bold lower case, respectively, while 
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deoxyribonucieotides are in upper case. An asterisk denotes a phosphorothioate linkage, while iT 
indicates an inverted 3-3' thymidine (Fig, 1). 

Speciiic aidl semi-rainidloiiinikei ©NA ODNs 

Semi-randomized 13mer antisense ODNs speciiKc for each of the target triplets GUC, GUA, AUG and 
AUA were synthesized and HPLC-purified. The ODNs were designed to mimic the hybridizing arms of 
symmetrically armed (6+6 nt) hammerhead ribozymes and had the following sequences (N denotes a 
randomized position): GUC-speciffic library, (N)gGAC(N)4; GUA-specnlc library, (N)gTAC(N)4; AUC- 

specffic library, (N)gGAT(N)4; AUA-speciffic library, (N)gTAT(N)4. Specific 13mer antisense ODNs 

targeting the selected ribozyme cleavage sites were also synthesized. These ODNs are identified by the 
cleavage position of their corresponding ribozymes, and have the following sequences: 0-118, 
GGGAAGG.4CCTTG; 0-238, GGCTTTG^CAGGG; 0-287, TCGGGGG^CCGGG; 0-465, 
ATGGCAri4CGGCT; 0-519, AGCTCCG^CTCAC; 0-539, GCACCCG.4CGCAG; 0-548, 
TGGCATG^CGCAC; 0-712, ATACCGG^CGCCA. 

PBasmMs 

The cDNA of the coding sequences of PSKHl was previously cloned in our lab and inserted into the 
expression vector pcDNA3 (Invitrogen) downstream of a T7 promoter, producing the plasmid pcDNA3- 
PSK* A Renilla luciferase expression plasmid, pEFl-Rluc, constructed by inserting the EFla promoter 
into the multiple cloning site of the pRL-nuU vector (Promega), was used as an internal control hi co- 
transfection experiments. This plasmid was a kind gift from Professor A^-B. Kolst0. 

Preparattiomi of PSKHl-Mcifferase imim connstractts 

The full-length coding sequence of PSKHl cDNA was cloned in-frame with firefly luciferase cDNA 
into the BgUVNcol sites of the pGL3 Enhancer expression vector (Promega) as a BamHI-Ncol PGR 
fragment, producing the plasmid pPSK-Luc. The PSK-Luc fusion was cloned into the tetracycline 
response plasmid pTRE (Clontech) in two steps. A Kpnl-Bamliii fragment of pPSK-Luc was first 
transferred to the EGFP-1 vector (Clontech). Subsequently, an Ecd91-Bam\{l fragment from this clone 
was excised and cloned into the same sites of the pTRE vector, giving pTRE-PSK-Luc. Finally, PSK- 
Luc cDNA was also cloned mto the pcDNA3 expression vector as an EcoSl-Xbal fragment for higher- 
level expression. This expression plasmid was termed pcDNA3-PSK-Luc. 

In vitro immm^tlm ©ff PSKHl MNA 

For in vitro transcription of PSKHl RNA with T7 RNA polymerase, the pcDNA3-PSK plasmid was 
Imearized mtemally with £caNI, producmg a 1 .03 kb transcript contaming 47 nt of 5' vector-derived 
sequence, or downstream with Xbal (producing a 1 .37 kb transcript). Protein and salt were removed 
from restriction reactions using JetQuick columns (Genomed). Purified DNA was eluted in DEPC- 
treated water (DEPC-HjO), precipitated with ethanol, and resuspended in DEPC-H2O. Run-off 

transcription of uniformly ^^P-labeled RNA was performed in 50 jil reactions containing 2 ng template 
DNA, 0.5 mM each of GTP, ATP, CTP and UTP (Boehringer Mannheim), 50 U RNasin (Promega), 1-2 
jil 10 \iC\l\i\ [a-^^V\tklV or rUTP (Amersham) and 50 U T7 RNA polymerase (New England Biolabs) 
in RNA polymerase buffer (40 mM Tris-HCl, 6 mM MgClj, 2 mM spermidine, 10 mM DTT, pH 7.9). 
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Reactions were incubated for 2 h at 'iVC RNA was then desalted by centrifugation through RNase-free 
G50 Sephadex QuickSpin Columns (Boehringer Mannheim) and deproteinated by phenol extraction. 
RNA was then precipitated with isopropanol and resuspended in RNase-free water. Alternatively, RNA 
was purified by LiCl-precipitation (2.5 M fmal concentration) following transcription. Yield and 
concentration of RNA were calculated from the percentage of incorporated radioactivity. 

AnnttiseiDise ODN ffiEdl ENase H-nifliedliated m vUro cleavage ©ff PSKHl RNA 
RNase H-mediated cleavage assays with antisense ODN libraries were performed in 10 ^1 reactions 
containing 40 ODN library, 50 nM PSKHl mRNA and 0.25 U RNase H (Promega) in a buffer 
containing 40 mM Tris-HCl pH 7.5, 10 mM MgCl2 and 10 mM DTT. A stock dilution of RNA in buffer 

was heated to 95°C for 60 s and then preincubated at 3^VC for 15 min before adding enzyme. An 8.0 \x\ 
mixture of RNA, buffer and enzyme was then mbced with 2.0 ^il 200 nM ODN library, yielding the 
indicated final concentrations of components. Reactions were incubated for 30 min at 37°C, and 
quenched on ice with 10 jil of denaturing loading buffer (8 M urea, 20 mM EDTA, 0,05% bromophenol 
blue, 0.05% xylene cyanol). Samples were heated to 95°C for 2 min prior to loading onto a 4% 
denaturing polyacrylamide gel. RNA size markers were prepared by run-off m vitro transcription of 
differently linearized (iVcoI, Xmnl, BsaAX, Nael, Bst\}\, Fokl and Ddel) pcDNA3 templates. Samples 
and standards were analyzed by electrophoresis for 3-4 h at 50 W at room temperature on 40 cm 
sequencmg gels. The gels were then transferred to Whatman 3MM paper, wrapped in plastic foil and 
exposed overnight in a Phosphor Screen (Molecular Dynamics) prior to analysis on a radioanalytical 
scanner (Storm 860, Molecular Dynamics). Fragment sizes were calculated from their measured 
mobilities and those of the size markers. Reactions with specific antisense ODNs (200 nM, 4-fold molar 
excess) were performed essentially as for the ODN libraries, 

Omltored cells 

HeLa cells (from ATCC) were maintained in Dulbecco's Minimal Essential Medium supplemented with 
2 mM glutamme and 10% fetal calf serum (all reagents from Gibco BRL). Upon thawing, the cells were 
grown to near confluency for 2-3 days and passaged at least once before they were used for 
experiments. Cells were routinely passaged every 3-4 days. 

TranBSHcmt co-ttramsffecttioims 

Cells were plated in 12-well (Costar) or 24-well (Sarstedt) plates at 30-40% confluency and transfected 
at an estimated 60-80% confluency the following day. Transfections were performed with 100 nM 
ribozyme (1.23-1.30 ^g/ml), 0.40 ng/ml reporter construct (pTRE-PSK-Luc or pcDNA3-PSK-Luc) and 
8 ng/ml intemal control plasmid (pEFl-Rluc). Nucleic acids were complexed with lipofectamine (Gibco 
BRL) (a final concentration of 8.2-8.5 ^ig/ml lipid) at a constant 1 :5 w/w ratio, foUowmg optimization 
with the transfection agent. Complexes were formed by mixing equal volumes of medium-diluted 
nucleic acids (ribozyme + DNA) and lipofectamine and incubating at room temperature for 30 min. The 
complexes were subsequently diluted to the final volume with serum-firee medium and added to pre- 
washed cells (250 and 500 ^il for 16 and 22 mm wells, respectively) for 5 h. The complexes were 
subsequently replaced with fiill medium. 



Uptake off FITC-kbekdl nSDozymnies 
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Uptake of FITC-labeled ribozymes under the standard transfection conditions was determined by 
fluorescence spectroscopy. Lipofectamine complexes were diluted with medium and chilled on ice prior 
to addition to cells (in 12-well plates) preincubated (for at least 30 min) either at 37X or on ice. 
Following transfection for 5 h, cells were harvested by washing three times with ice cold phosphate- 
buffered saUne (PBS) and lysed with 1.5 ml PBS-TDS (1% Triton, 0.5% deoxycholate and 1% SDS in 
PBS) for at least 1 5 min at room temperature under vigorous shaking to achieve complete lysis. Parallels 
were combined and fluorescence recorded on a spectrometer (LS-5, Perkin Elmer) at excitation and 
emission wavelengths of 492 and 522 nm, respectively. 

Luciferase activBity assays 

Cells were harvested 24 h after initiation of transfection and washed twice in cold PBS prior to lysis for 
30 min on ice with a passive lysis buffer supplied with the Dual-Luciferase® Reporter Assay System kit 
(Promega). Following brief centrifugation to pellet cell debris, luciferase assays were performed in white 
non-transparent 96-well plates (Nunc) using a plate-reading luminometer (MicroLumat Plus, EG&G 
Berthold) equipped with two injectors. Dual-luciferase assays were performed on 25 |il lysate 
supernatant. Firefly and Renilla luciferase activities were recorded following the respective injections of 
100 ^il LAR n and 100 ^1 Stop&Glow reagents of the assay system according to the manufacturer's 
instructions. The instrumental background levels of luminescence were recorded for empty wells. 



D> RESULTS 



^ TOP 

^ ABSTRACT 
^ INTRQ]D)UCT11QN 
^ MATERIALS AND METHODS 
° RESULTS 
^ DISCUSSION 
^ REFERENCES 



SeleettiioiQi of nbo^mme tergei sntos Iby MFoM secomidaiiry strMctar® 
predktioE 

We have^used the MFold program (version 3,0 at 
http://mfold2.wustl.edu) (7,8) to predict the secondary structure of 
PSKHl mRNA as a means of selecting GUC sites for riboi^me 
targeting. The MFold program generated many suboptimal secondary 
structures that differed minimally in energy from the optimal folding. Although these secondary 
structures displayed varying degrees of folding differences, certam substructures could be identified that 
demonstrated a considerable degree of conservation. High conservation of these substructures among 
alternate suboptimal structures may increase their likelihood of being 'true' structures (i.e. contamed in 
the actual secondary structure of the mRNA in vivo). Therefore, in selecting GUC sites for hammerhead 
ribozyme targeting, we have screened 10 optimal and suboptimal secondary structures of the &st 1.0 kb 
of the PSKHl transcript, as transcribed from pTRE-PSK-Luc, for recurring substructures. All selected 
target sites were located within identical substructures in at least six out of the 10 most energetically 
stable structures, including the optimal structure (Table 1). For some target sites, several of the 
suboptimal structures diverged only slightly from the consensus in ways that did not significantly 
change the basic characteristics of the substructure. Due to the lack of a consensus on what kind of 
structures make the best targets, we have chosen GUC targets (guc238, guc287, guc519 and guc712) that 
are predicted to be located within very different substructures (Fig. 2). The guc238 cleavage triplet is 
located within a large internally looped stem in which the 5' arm (stem I) of the corresponding ribozyme 
is complementary to the nucleotides of the loop. The guc287 target sequence, on the other hand, is 
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predicted to fold into a short stem-loop structure. The gucS19 target site is located at the end of what is 
essentially a very long basepaired region that is interrupted by occasional short bulges. A ribozyme was 
also designed targeting guc712, located at a particularly stable hairpin structure at the end of a very long, 
and presumably very stable, stem. This target site was mcluded due to the high conservation of its 
secondary structure among suboptimal structures of the target RNA (identical in 10 out of 10 and 19 out 
of the 20 energetically most favorable structures). In addition to the above target sites, the first GUC site 
in the RNA (gucl 1 8) was selected due to its close proximity to the translation initiation region (only -20 
nt downstream of the initiation codon). This region has often been targeted (1233.47A8) because the 
RNA in this region might be relatively open due to the need for binding of the components of the 
translation machinery. The predicted secondary structure for the gucl 1 8 target consists of a stem with 
two internal loops (Fig. 2). 

View this table: Table 1. Features of the predicted secondary structure of selected ribozyme 
[in this window] target sites 
[inan^WWindow] 
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In vitro accessibility assays with semi-randomized ODN libraries and specific ODNs 

Target triplet-specific, semi-randomized ODN libraries were used to screen in vitro transcribed PSKHl 

RNA for hybridization-accessible GUC, GUA, AUG and AUA sites. Accessible sites were identified by 

RNase H-mediated cleavage of the RNA at the RNA-DNA hybrids generated by hybridization of 

antisense ODNs to target RNA (19-21). Screening of a 1 .03 kb PSKHl transcript fi-om the pcDNA3- 

PSK plasmid was performed separately with each of the four ODN libraries (Fig, 3 A). Since the 

transcript was uniformly labeled, cleavage at any position produced two cleavage products. This 

complicated the interpretation of the data since each pair of cleavage fragments suggested two possible 

sites of cleavage. Screening reactions were subsequently performed on a longer RNA transcript 

(1 .37 kb) that differed fi^om the first transcript in its 3*-end. This facilitated the interpretation of cleavage 




Figure 2. Excerpts of the energetically optimal secondary structure 
of the first 1000 bases of the pTRE-PSK-Luc transcript, as 
predicted by MFoId version 3.0 (7,8). Target sequences of the 
ribo^mes are highlighted. Arrows indicate position of cleavage. 
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data, as 5' cleavage fragments were identical for the two RNAs* Screening of two differently sized 
transcripts increased confidence in the results. Any accessible sites that are not identified in Hie longer 
transcript are likely to be due to structural features of the specific transcript that may not be represented 
in the full-length mRNA- Such sites are more likely to be encountered at the ends of the transcripts since 
the local folding of these regions will be most strongly affected by the lack or presence of additional 
sequences. Consequently, any cleavage sites originating within the first or last 200 nt of the transcript 
were ignored in the selection of accessible sites by library screening, as were cleavage sites that could 
not be detected in the longer transcript. 




Figimre 3* In vitro accessibility assays- (A) Separate screening of a K03 
kb PSKHl in vitro transcript with triplet-speciifDC ODN libraries. 
Reactions were performed with 50 nM RNA and 40 |iM ODN library 
as detailed in Materials and Methods. Arrows indicate pairs of 
fragments resulting from cleavage at subsequently selected ribo^me 
target sites. (B) Cleavage of PSKHl RNA (50 nM) with molar excess 
(200 nM) of specific ODNs against ribo2yme target sites selected by 
oligo library screening (0-465, 0-519, 0-539, 0-548) and structural 
considerations. 
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The screening assays demonstrated several reproducible cleavage sites that obeyed the above 
restrictions. Most cleavage sites were identified with the GUA and GUC libraries, while no useful sites 
were identified with the AUA library (Fig. 3A). Screening of PSKHl RNA with the GUA library 
produced two very prominent cleavage fragments that were not produced by any of the other libraries, 
demonstrating their GUA triplet specificity. The sizes of these two fragments were estimated at --615 
and 405 nt Their combined estimated size of 1020 nt was in good agreement with the transcript length 
(which was 1030 nt). The shorter fragment was shown to be the 5' cleavage fragment, as it was also 
present when screening the longer RNA transcript (data not shown). The size of this fragment was 
consistent with cleavage near a site corresponding to position 460 in the PSKHl cDNA. Examination of 
the sequence indicated a suitable GUA triplet at position 463-465 as the only probable site of cleavage. 
Screening with the GUC library produced six closely spaced fragments with estimated sizes ranging 
from 470 to 560 nt which fitted nicely into three pairs with combined sizes of the expected length 
(1020-1030 nt). By a similar analysis as for the GUA library cleavage fragments, the putative cleavage 
sites were identified as guc5 1 9, guc539 and guc548, respectively. One of these sites, guc5 19, had 
previously also been selected on the basis of MFold secondary structure predictions (see above) prior to 
the antisense library screening. The AUG library screening also produced several fragment pairs, but 
these were either weaker than their GUA and GUC counterparts, or could not be unambiguously 
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assigned to a specific triplet. The screening assays were performed with a ratio of ODN (40 [iM) to 
target RNA (50 nM) of 800:1, which given the complexity (4*° = 1 x 10^) of the libraries corresponds to 
a ratio of speciiffic ODN (40 pmol) to RNA of roughly 1 : 1 250. This might seem too low a ratio to 
reasonably account for the degree of cleavage observed. However, the observed cleavage would 
correspond to a tumover of only -100 molecules/h. Furthermore, the concentration of ODNs that can 
productively hybridize to a given target site may be significantly higher as mismatches corresponding to 
the ends of the ODNs are not expected to severely impair hybridization efficiency. Non-triplet-specillic 
cleavage events (uivolving ODNs with mismatched or wobble-paired triplets, as well as partial 
hybridization of only the random-armed portions) may also conceivably occur, although they are less 
likely to account for the major cleavage events. Non-speciffic cleavage events would also be expected to 
result from screening with more than just one library. Examples of such cleavage events were observed, 
but did not include the selected cleavage sites. Nonetheless, to ensure that putative ribozyme target sites 
were properly identified, the provisional identifications were subsequently confirmed by cleavage of the 
RNA with the corresponding spedific antisense ODNs. In all cases, the cleavage fragments produced by 
the specniFic ODNs co-migrated with the corresponding cleavage fragments produced by the library 
screenings (data not shown). These results demonstrate the utility of triplet-specMc semi-randomized 
ODN libraries in identifying potential target sites. 

In vitro cleavage assays were performed with specific ODNs against all sites targeted by ribozymes to 
investigate whether the sites that were selected by library screening were more accessible than target 
sites selected by alternative means. The specific ODNs, like those of the semi-randomized libraries, 
were designed to have the same recognition sequences as a corresponding hypothetical 6+6 armed 
hammerhead ribozyme. The in vitro cleavage assays demonstrated that all the ODNs targeting sites that 
were selected by library screening resulted in stronger cleavage than the best among the ODNs targeting 
sites selected by theoretical/structural considerations (Fig. IB). Furthermore, the strongest cleavage was 
achieved with the ODN (0-465) corresponding to the GUA site that was cleaved most strongly in the 
library screenings (Fig. 3A). Thus accessibility data obtained from library screening assays were 
representative for the hybridization efficiencies of the corresponding unique ODNs. Consequently, 
major cleavage events do not appear to be substantially influenced by the presence of non-spec51fic 
ODNs in the library screening. This suggests that neither positive nor negative cooperativity of bmding 
(49) occurs to a significant degree under our assay conditions. 

Cdtalar uptake of ribozyme 

FITC-labeled control ribozymes were used to investigate the cellular uptake of ribozyme under the 
optimized co-transfection conditions by spectroscopy. In order to distinguish between mtracellular 
uptake and membrane association by spectroscopic analysis, uptake experunents were performed in 
parallel by incubating cells on ice as well as at 3VC. Similar amounts of MRz-FITC and ARz-FITC 
ribozymes were found to be associated with cells following incubation both on ice (26-28% of total 
added ribozyme) and at 37°C (58-59% of added ribozyme) (data not shown). The increase in cell- 
associated fluorescence at the higher temperature, suggests that m excess of 30% of ribozyme has been 
internalized. The cellular association of ribozyme following incubation on ice was shown to have a 
linear dependency on ribozyme concentration in the range 0-100 nM (data not shown). This is consistent 



Secondary structure prediction and in vitro accessibility of mRNA as tools in the selecti-i Page 11 of 23 

with a state of equilibrium between membrane-bound and free (in medium) ribo2yme. Assuming that 
this equilibrium is maintained upon internalization of ribozyme at the permissive temperature, the 
amount of internalized ribozyme is determined by the foraiula /= 3n[B-a)/(r--«t), where T is the total 
amoxmt of ribozyme added to cells, and a and B are the measured cell-associations upon incubation on 
ice and at 37°C, respectively. Net uptake of ribozyme under the given transfection conditions was 
estimated at 42-45% (data not shown). Co-transfection optimization experiments performed at various 
w/w ratios ( 1 : 1 to 6: 1 ) of lipofectamine to total nucleic acids demonstrated that uptake was only slightly 
influenced by lipofectamine concentration above a certain threshold (2:1 w/w ratio) that constitutes a 
molar excess of positive charges (from cationic liposomes) in the complexation mixture. Below this 
threshold (at a 1 : 1 w/w ratio, yielding complexes of net negative charge), uptake was severely impaired 
(data not shown). 

EiFfficacy off ribozymes icD c©4raini§ffecttioM experiinnieiflitts iini HeLa c<5llls 

The in vivo efficiency of the selected ribozymes was evaluated in a cell culture assay in which 100 nM 
of ribozyme was co-transfected with a plasmid coding for a PSKHl-luciferase fiision protein. In order to 
correlate for transfection variability and improve experimental reproducibility, a plasmid coding for 
Renilla luciferase (Rluc) under the control of an EF la-promoter was added to the transfection mixture as 
an internal control. This control proved to be essential as the general expression levels oi Renilla 
luciferase varied up to 2-3-fold within the same experimient for different ribozymes (data not shown). 
Experiments Were performed as far as possible with the full complement of ribozymes of identical 
chemistry so that in each experiment, the efficacies of ribo^mes were compared under identical 
conditions. In each experiment, the relative firefly luciferase activity for all ribo:ganes was normalized 
to the levels for the control ribozyme (the expression of which was set at 100%). Normalization was 
always performed relative to the relevant control ribozyme. The data from experiments with both 
methylated and allylated ribozymes are summarized in Figure 4. The most effective methylated 
ribozyme in co-transfection assays proved to be MRz-287, targeting guc287. This ribozyme reduced the 
level (normalized) of luciferase expressed from the reporter gene to -23% of the control levels, i.e. the 
levels in cells treated with irrelevant control ribozyme. The second most efficient ribozyme was MRz- 
519, which resulted in -65% inhibition of expression. The ribo:^mes targeting sites gucl 18 and guc548, 
exhibited similar inhibitory effects, with residual expression levels just over 40%. At the other end of the 
spectrum we fmd the ribozymes targeting guc712 and guc539, which result in only -30% mhibition. 

Figanre 4. Lipofectamine-mediated co-transfection of HeLa 
cells with ribozyme, pTRE-PSK-Luc, and pRLuc as detailed 
in Materials and Methods. Eight methylated (MRz-) and three 
allylated (ARz-) PSKHl-speciffnc ribozymes were analyzed 
together with their respective controls (MRz-TF and ARz- 
TF). PSKHl -dependent firefly luciferase expression was 
normalized to Renilla luciferase expression for each sample* 
Normalized expression in cells transfected with control 
ribozymes within each series was set at 100%. Data are 
averages of 4-7 independent e3q)eriments. Expression levels 
are indicated above the error bars (+SD) of each column. 
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Three of the ribozymes were synthesized also in the allylated version to evaluate any systematic 
differences in inhibitory capacity of methylated and allylated ribozymes. The effect of the allylated 
ribozymes varied over a relatively wide range, resulting in residual reporter gene expression ranging 
from -^35% for ARz-519 to 95% for ARz-539 (Fig. 4). Comparing the data for these ribozymes with 
those of their methylated counterparts, we observed the same activity ranking. For both types of 
modification a ribozyme targeting the guc5 19 cleavage site was superior to the ribozymes targeting the 
two other sites, gua465 and guc539, of which the latter site appeared to be least amenable to cleavage. 
This constitutes evidence that the observed mhibitory effects of these ribozymes are sequence speeiffic. 
The difference in activity of methylated and allylated versions of otherwise identical ribo^mes varies 
with the target of the ribozymes. While the two types of ribozymes targeting guc5 19 are equally active, 
the allylated ribozymes targeting gua465 and guc539 are less active than their methylated counterparts. 
The combined data suggest that methylation generally results in superior ribozymes and, further, that the 
allyl modification seems to be more detrimental to ribozyme efficacy when targeting poorly accessible 
sites. Reduced efficacy of allylated ribozymes is not due to differences in intracellular uptake, as the 
methylated and the allylated FITC-labeled ribozyme were internalized to the same level by 
lipofectamine-mediated transfection (see uptake experiments). 

Rnbozyme effects depeMd oe rnlbci^^iiMe cosncemittratioiii annd ttsirget geme proMoteir sftreEgth 
The concentration dependence of the ribozyme effect was investigated in co-transfection experiments in 
which the concentration of the active anti-PSKHl ribozyme was varied and the total concentration of 
ribozyme adjusted to 100 nM with control ribozyme. For dose-dependence experiments, the most active 
allylated (ARz-519) and methylated (MRz-287) ribozymes were selected. These experiments confirmed 
the dose-dependence of the ribozyme effect (Fig. 5 A). We next wanted to investigate whether increasing 
the steady-state target gene expression would reduce the specific mhibitory effect of ribozymes. Higher 
expression of the reporter was achieved by placing it under the transcriptional control of the CMV 
promoter m the pcDNA3 expression plasmid. This resulted in 20-fold enhancement of rqwrter 
expression (data not shown) compared to the weaker tetracycline-responsive promoter. In co- 
transfection experiments, this increase in reporter expression was accompanied by reduced apparent 
efficiencies of three of the most active PSKHl ribozymes (Fig. 5B). The average levels of inhibition of 
target gene expression with the weak promoter were 73, 63 and 51% for ribozymes MRz-287, MRz-519 
and MRz-548, respectively. With the strong promoter, yielding the 20-fold higher expression of reporter 
(measured in control ribozyme-treated cells), their inhibitory activities were 60, 43 and 33%, 
respectively. Thus the inhibitory effect of ribozymes, as expected, was dependent on the concentrations 
of both ribozyme and target, but the ribozymes were able to inhibit almost completely the 20-fold 
increase in the target mRNA. 

Fignnr© 5. Dependence of ribozyme inhibitory activity on ribozyme 
concentration and reporter gene expression. (A) Transfection of HeLa 
cells with increasing concentrations of ARz-519 or MRz-287 
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ribozymes. Total concentration of ribozyme is adjusted to 100 nM 
with control ribozyme (ARz-TF or MRz-TF) and transfections 
performed after the standard protocol. Data for representative 
experiments are shown. (B) Parallel co-transfections of HeLa cells 
with selected ribozymes and different reporter constructs (pTRE- 
PSK-Luc and pcDNA3-PSK-Luc). Relative expression levels in 
control cells were 20-fold higher with the pcDNA3-PSK-Luc 
construct compared to pTRE-PSK-Luc. 
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Comparing ribozymes selected by MFold secondary structure prediction and in vitro accessibility 

assays 

Ribozymes were ranked based on their inhibitory activity in co-transfection assays, with tied ranks given 
to ribozymes resulting in residual expression levels differing by <3% (Table 2), This classification can 
be used to assess the relative merits of the two methods of target site selection. The four ribozymes that 
were selected on the basis of library screening data (MRz-465, MRz-519, MRz-539, MRz-548) were 
compared to those selected on the basis of structural considerations (MRz-1 1 8, MRz-238, MRz-287, 
MRz-519, MRz-712) by Wilcoxon's rank sum test on unpaired samples (50). The rank sums for the two 
groups of ribozymes were 18 (average rank = 4.5) and 20 (average rank = 5.0), respectively, which are 
higher than the critical rank sum of 1 1 required for a 5% significance level From these data, ribozyme 
targets selected on the basis of m vitro accessibility are no more susceptible to ribozyme cleavage in vivo 
than those selected by structural prediction or other considerations. 
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Correlation of riboi^me efficacy with the predicted target secondary structure 
The two methods of target selection both resulted in ribo^mes with very diverse inhibitory activities. 
We therefore decided to investigate whether there was any correlation between predicted structure of 
target sites and the in vivo efficacy of corresponding ribozymes. We approached this by attempting to 
break down the secondary structures of the target sequences in readily quantifiable parameters. In doing 
so we have disregarded certain factors that may influence hybridization efficiency but which cannot be 
easily quantified. Such factors include the positioning of single-stranded stretches within the target 
sequence and the ability of a specific hanmierhead sequence to assume its proper catalytic three- 
dimensional structure. The presence of so-called free ends, the positioning of single-stranded stretches at 
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the ends of target sequences, may be expected to be correlated with enhanced antisense binding (12), 
while elem«its of strong secondary structure in the neighborhood of the target sequence proper might 
impede ribozyme folding. We do not imply that these factors are unimportant, merely that as they cannot 
be easily quantified, oflier more readily quantifiable parameters should be investigated for predictive 
value. Three potentially important parameters for target site accessibility were local free enwgy of 
folding of the target sequence (U), the size of single-stranded stretches (loops) that may fimction as 
'hooks' for nucleation of duplex formation (51,52), and the length and stability of stems and helices. 
Stems and helices may need to be opened up for full hybridization of ribozyme to occur and their length 
and stability may therefore influence hybridization efficiency. Consequently, the target sequences of all 
ribozymes were decomposed into length of the major loop, stem and helix, while free energies have been 
calculated both for the target sequence as a whole and for its major helical region (Table 1). Target 
sequence free energies were calculated for the energetically optimal folding of the RNA, by adding up 
the energy contributions from all base-pairings, stacking interactions, bulges and various loops that are 
contained within the target sequences, as indicated by the MFold program. MFold was also used to 
determme the free energy of the most stable ('limiting') helical region and to estunate the strength of the 
ribozyme-target hybrid (by folding the corresponding cw-acting ribo2yme in which catalytic and 
substrate strands were connected through a 5 nt loop at stem I). 

Spearman's rank correlation test (53) was used to mvestigate the level of correlation of ribozyme 
efficacy rankmg with various features of the predicted secondary structure of the target sequences (Table 
2). No correlation was observed with the length of the major loop (Pearson's correlation coefficient r = 
0.16) or ribozyme-substrate duplex free energy (r = 0.18). The correlation with target sequence free 
energy was weak (r = 0.35) and not significant Ribozyme efficacy was, however, significantly 
correlated (P <0.025) with both the length of the major base-paired stretch (r = 0.75) and the energy of 
the most stable helix (r = 0.77) within the target sequence (Table 2). Conelation was improved (r = 
0.84) when considering stem length and helix stability together (for this analysis, the rank was taken as 
the average of the two individual ranks for stem and helbc). Correlation coefficients were not 
significantly affected by ranking ribozyme activity without the use of ties (Table 2). 
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In this study we have analyzed DNA-armed chemically modified 
hammerhead ribozymes targeting eight GUC and GUA sites selected 
by two different methods, in vitro accessibility assays and MFold 
prediction. In a co-transfection controlled assay, the ribozymes 
resulted in residual luciferase reporter gene expression ranging from 
23 (MRz-287) to 72% (MRz-539) (Fig. 4). This activity may in part be due to the presence of DNA arms 
which allow RNase H-mediated cleavage of DNA-RNA hybrids. Target gene expression normalized to 
the expression of a co-transfected moE-4arget gene allowed the control of any nnoini-torget-speciffic 
sequence effects of the ribozymes, Non-speciflic effects are occasionally encountered in association with 
extended stretches of phosphorothioate (P=S) linkages (19). In an attempt to minimize such effects, no 
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more than two consecutive P=S linkages were incorporated. Finally, ribozymes of different chemistry 
(with methylated RNA mstead of DNA in the arms) against two of the targets (guc519 and gucS48) were 
less efficient in inhibitmg target gene expression than their DNA-armed counterparts (unpublished data). 
In combination, the above data make it unlikely that the observed effects are due to aspects of ribozyme 
sequence or chemistry unrelated to hybridization-specific activity. 

The inhibitory effect of ribozymes was dependent on both the ribozyme concentration (Fig. 5A) and the 
steady-state expression levels of the reporter gene (Fig. 5B). A reduced inhibitory effect was observed 
for three selected methylated ribozymes when expressing the reporter gene from a stronger promoter 
resulting in 20-fold higher expression levels (Fig. 5B). However, the molar amount of target suppression 
when the stronger promoter was used increased to a degree ahnost matching the increase in the level of 
the exogenous target. This interesting fact suggests an enzymatic rather than a stoichiometric effect of 
the ribozymes on target gene expression. The enzymatic function may derive entirely from the ribozyme 
or include RNase H activity. It should, however, be noted that the increase in the total level of PSKHl 
mRNA by expressing the exogenous transcript from the stronger promoter depends also on the unknown 
level of the endogenous transcript, which is another target for the ribozyme. The level of suppression 
achieved with the best ribozyme in this study (77%) is similar to that which has previously been reported 
for the best of several unmodified ribozymes in a similar luciferase reporter gene co-transfection assay 
(16). Other comparable studies have reported inhibition levels in the range of 40-80% for a set of 1 5 
ribozymes (54)> 50% inhibition with a pair of variously modified ribozymes (29), and 40-55% inhibition 
for a pair of 2'-F-pyriniidine modified ribozymes with phosphorothioates at both ends (27). Thus our 
results, achieved with generally more extensively modified ribozymes, compare favorably with previous 
reports. 

We have also attempted to evaluate the relative effects of two conunonly employed types of 2'-0- 
modifications on the activity of ribozymes. The same rank order of ribozymes targeting three selected 
sites was observed for both methylated and allylated DNA-armed ribozymes (Fig. 4). Allylated 
ribo2ymes were generally less efficient in inhibiting reporter gene expression in co-transfection assays 
compared to their methylated counterparts. The difference in activity of methylated and allylated 
versions of otherwise identical ribozymes appeared to correlate with the susceptibility of the target to 
inhibition by ribozyme. While the ribozyme targeting the least accessible of the dually target sites 
(guc539) apparently was most sensitive to the type of alkylation, ribozyme species of either modification 
were equally efficient when targeting the most susceptible site (guc519). 

Predicted secondary structures of targets selected on the basis of the MFold program were very diverse 
and included a short stem-loop (guc287), a stem wdth a large internal loop (guc238), a bulged stem 
(guc519), and a hairpin structure (guc712), as well as a site near the translation initiation site that was 
presumed to be relatively unstructured (Fig. 2). In vitro accessibility assays wdth cleavage triplet-specffic 
ODN-libraries identified three accessible GUC sites and one GUA site (Fig. 3 A). In vitro cleavage 
assays with specific ODNs against all sites targeted by ribozymes confirmed that the sites that were 
selected by library screening were indeed more accessible in vitro than sites selected by alternative 
means (Fig. 3B). This confirms the utility of such semi-randomized libraries for identifying the most 
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accessible sites in vitro. However, ribozymes targeting sites selected on the basis of in vitro accessibility 
assays were no more efficient in inhibiting target gene expression in a co-transfection assay than 
ribozymes targeting sites selected by theoretical means. In fact, the ribozyme targeting the most 
accessible site in vitro (MRz-465) ranked only as the fifth most active (Table 2), while the target site of 
the best ribozyme (MRz-287), was relatively inaccessible in vitro (Fig* 3B). The lack of correlation 
between />2 vitro accessibility and in vivo efficacy data is consistent with previous observations of other 
researchers (55-58). A possible explanation for the poor correlation is that the target mRNA is folded 
differently in vitro than in vivo. The secondary structure of the folded RNA may not be the energetically 
most stable (fast local folding events may prevent more energetically favorable interactions between 
distal regions). Structural features that promote hybridization in vivo and in vitro may also differ. 
Furthermore, the hybridization efficiency of 1 3mer antisense ODNs may not be entirely representative 
for the hybridization characteristics of longer (32mer) hammerhead ribozymes with varying degrees of 
secondary structure of their own* Finally, the generation of higher-order mRNA structures and 
modulation or masking of the mRNA secondary structure by RNA-binding proteins (9, 10) or protein 
complexes (ribosomes) (59) in vivo may influence accessibility of target sites, although these factors 
would also tend to rule out the applicability of secondary structure predictions for target site selection. 
Recent data suggest that performing in vitro accessibility assays on an endogenous transcript m a protein 
environment (cell extracts) may improve the accuracy of the predictions (14,22). 

Although in vitro accessibility assays proved to be of Ihnited predictive value for the in vivo situation, 
correlative studies suggested that secondary structure predictions might have some merit Significant 
correlation was found between ribozyme efficacy and the presence of short stems and energetically 
unstable helices withm the ribozyme target sequence (Table 2). Ranking of ribozymes according to these 
two criteria, the relative efficacies of ribozymes were predicted nearly perfectly, the only significant 
discrepancy being the ribozyme targeting gua465. As well as being most accessible in vitro, this site also 
has a secondary structure that according to the above criteria should make it a significantly better target 
site than observed here. Other factors, such as the lack of single-stranded bases near the ends of the 
target sequence or a prohibitive environment for ribozyme folding, may explain the results. 
Notwithstanding this discrepancy, there is an apparently clear correlation between ribozyme efficacy and 
predicted target sequence secondary structure. Furthermore, the combined data from this study suggest 
that the previously reported correlation of ribozyme efficacy with local folding potential (H) may be 
incidental. Our data suggest that high target sequence free energy alone may not be sufficient for 
efficient ribozyme targeting. We propose that the above correlation is a consequence of the need to have 
some unpaired regions to facilitate fast nucleation of duplex formation (51,52) . combined with short 
base-paired regions and helices of low stability that easily open up. Fulfillment of these eoteria will in 
many cases result in a low local folding potential (high free energy) for the target site. 

A recent study on the effect of varying RNA secondary structure on the efficiency of specffic antisense 
ODNs concluded that target sequences located within regions designed to be unstructured were most 
effective, while targets within stable stem-loop structures were ineffective (13). Recent reports by Patzel 
and co-workers (12,14) described a theoretical approach for antisense ODN target site selection based on 
the prediction of large single-stranded stretches (loops) by MFold. Our data do not support a correlation 
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of ribozyme activity with the length of loops, possibly because all our targets had shorter predicted loops 
than recommended in the above studies. Our hypothesis and the conclusions of the above studies are, 
however, not mutually exclusive. Target sites containing very large loops will have a good probability of 
also containing short helical regions, which we propose to be the limiting factor. In fact, applying our 
hypothesis of target site evaluation to three previously well characterized target sites (t351, t398, t498) 
(14,22) within mRNA for murine DNA methyl transferase, the same rank susceptibility of targets, in 
perfect accordance with actual inhibition data, is predicted by both theories* 

In addition to the structure of the target site, the composition of the target sequence may also be of some 
unportance. Sequences with a high G+C content will hybridize more efficiently with the complementary 
arms of their ribo2ymes and possibly increase the efficacy of the ribozyme. Although a general 
correlation between hybrid stability and ribozyme efficacy was not supported by our data (Table 2), it is 
worth noting that the most efficient ribozyme, MRz-287, has a substantially higher affinity for its target 
sequence than the otfier ribozymes (Table 1) due to an unusually high G+C content (13 out of 16 nt)- In 
conclusion, the target sequence of MRz-287 represents the proposed desirable structural features for a 
good ribozyme/antisense target site* The target sequence consists of alternating short stretches of paired 
and unpaired bases, which limit stems and helical regions to no more than 3 bp. All other target 
sequences fold into secondary structures containing longer and more stable helical regions. One study 
has reported that hybridization accessibility for hammerhead ribozymes is correlated with the presence 
of unpaired bases near the cleavage triplet (60). Although our data do not suggest this to be a critical 
requirement for in vivo activity, the above criterium is also fulfiled for the guc287 target site, as the 
longest single-stranded stretch is situated around the cleavage triplet and mcludes the base preceding the 
scissile bond (Fig. 2). All of the above mentioned features of guc287 add up to a very effective ribozyme 
target site, in good agreement with its observed inhibitory cjq)acity in HeLa cells. 

In conclusion, our study indicates that there is a poor correlation between the apparent in vivo 
accessibility of a target and its accessibility in a completely cell-free in vitro assay as performed here. 
Thus such assays appear to be of limited value even for a preliminary selection of target sites. However, 
predictions by the MFold program suggest a correlation of certain features of the predicted secondary 
structures of target sequences, helical stability in particular, with ribozyme efficacy. The generality of 
these findings will, however^ need to be investigated in an alternate test system. If these correlations 
should be confirmed, this would represent a significant improvement in the preliminary selection of 
candidate ribozymes. Ultimately, however, an empirical cell-based assay will still need to be performed 
to select the best of these candidates. 
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ABSTRACT 

We have investigated the relative merits of two 
commonly used methods for target site selection for 
ribozymes: secondary structure prediction (MFold 
program) and /n vitro accessibility assays. A total of 
eight methylated ribozymes with DNA arms were 
synthesized and analyzed in a transient co-transfection 
assay in HeLa cells. Residual expression levels 
ranging from 23 to 72% were obtained with anti-PSKHI 
ribozymes compared to cells transfected with an 
irrelevant control ribozyme. Ribozyme efficacy 
depended on both ribozyme concentration and the 
steady state expression levels of the target mRNA. 
Aliylated ribozymes against a subset of the target 
sites generally displayed poorer efficacy than their 
methylated counterparts. This effect appeared to be 
influenced by /n vivo accessibility of the target site. 
Ribozymes designed on the basis of either selection 
method displayed a wide range of efficacies with no 
significant differences in the average activities of the 
two groups of ribozymes. While in vitro accessibility 
assays had limited predictive power, there was a 
significant correlation between certain features of 
the predicted secondary structure of the target 
sequence and the efficacy of the corresponding 
ribozyme. Specifically, ribozyme efficacy appeared 
to be positively correlated with the presence of short 
stem regions and helices of low stability within their 
target sequences. There were no correlations with 
predicted free energy or loop length. 

INTRODUCTION 

Hammerhead ribozymes are potentially powerful tools for 
sequence-specific inhibition of target gene expression (1). 
Their intrinsic cleavage activity makes them theoretically 
superior to traditional antisense oligodeoxynucleotides 



(ODNs) in terms of inhibitory capacity. Recent advances (1-3) 
have extended the range of targets so far that virtually any 
limited stretch of RNA is now likely to contain a useful target. 
However, other problems, including methods of delivery and 
target site selection, remain to be solved. The latter, in 
particular, appears to be a critical step in the design of anti- 
sense or ribozyme molecules for suppression of target gene 
expression, as there may exist only a few sites within any 
mRNA that are accessible to hybridization (4,5). We have 
compared two commonly employed methods for the rational 
selection of target sites, secondary structure prediction and 
in vitro accessibility assays. As target we have used the mRNA 
for a novel human protein kinase, PSKHl (6). 

RNA secondary structure prediction algorithms based 
mainly on energy minimization have intrinsic limitations, 
although subject to continuous improvement (7,8). An established 
theory for selection of target sites based on predicted structure 
is also required. The possible modulation of RNA structures by 
protein binding in vivo (9, 10) cannot yet be modeled and has so 
far limited their use in designing antisense ODNs or 
ribozymes. A positive coirelation between the inhibitory effect 
of an antisense RNA and low local folding potential has been 
noted (high AG) (11). Together with more recent data (12-14), 
this suggests that extended single-stranded or unstructured 
regions may be the best targets for antisense ODNs. However, 
although ribozymes targeting predicted loop regions have 
proved efficacious in some cases (15,16), there are also examples 
of failures (17). A recent systematic analysis of the hybridization 
of tRNA-Phe to a set of complementary ODNs determined that 
all high-yield heteroduplexes involved RNA sequences 
forming both double-stranded stems and single-stranded 
regions, and that bases of the latter regions were often stacked 
onto the stems (18). This suggests the requirement for at least 
some degree of helical conformation in the secondary structures 
of favorable targets. 

The accessibility of different stretches of the mRNA for 
hybridization with short antisense ODNs may be determined 
in vitro by RNase H-mediated cleavage of the RNA at regions 
where the ODNs hybridize to the target transcript (14,19-22). 
Screening of a large set of ODNs targeting potential ribozyme 
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Figure 1. Schematic of the hammerhead ribozyme hybridizing to its target 
mRNA. The arrow indicates the position of cleavage in the mRNA. Numbering 
is according to the nomenclature of Heitel et at (26). Unmodified ribonucleotides 
are in bold lower case, 2'-0-alkylated ribonucleotides in plain lower case, and 
deoxynucleotides in upper case. Phosphorothioates are indicated by asterisk, 
while iT denotes an inverted 3-3' thymidine. Bases of the GUM target triplet, 
where M is C or A, are boxed. 



phosphorothioate linkages at the 5'-end (28,29), 3'-end (34) or 
both (27) for stabilization against exonucleases. We included 
two phosphorothioates at the 5'-end and one at the 3'-end. 
DNA nucleotides in the arms have been reported to increase 
catalytic efficiencies of ribozymes in vitro, most likely due to 
increased product dissociation rates (37-39). DNA-armed 
ribozymes may recruit RNase H activity upon hybridization 
with the target RNA and thus enhance their apparent activity 
(40). We are here primarily interested in the accessibility of 
cleavage sites. Anything that reduces the importance of intrinsic 
ribozyme cleavage activity and increases the importance of 
target-specific inhibition of expression is desirable. 

Cellular delivery of ribozymes is commonly accomplished 
with various cationic liposome formulations (16,19,27-29). 
We have used the cationic liposome reagent lipofectamine to 
co-transfect HeLa cells with a mixture of ribozyme, a firefly 
luciferase reporter gene construct containing the complete 
coding cDNA of the target PSKHl, and a Renilla luciferase- 
encoding plasmid serving as an internal transfection control. 
We have succeeded in constructing a ribozyme against PSKHl 
mRNA which reduced the activity of the corresponding 
reporter gene to 20-25% in a concentration dependent manner. 
A correlation is observed with certain features of the predicted 
secondary structure of the target mRNA. 



cleavage sites has been employed to select the most promising 
sites for ribozyme targeting (19). Accessibility assays with 
specific ODNs have also been performed in cell extracts on 
endogenous transcripts in order to better approximate the in vivo 
situation (14,22). The most accessible sites within any target RNA 
may be selected by performing the RNase H-assay with a 
randomized set of ODNs (20,21) or with a target-specific set of 
cDNA fragments prepared by partial DNase I-digestion (23). 
ODN libraries in previous studies (20,21) have been more or 
less completely randomized. We wanted to identify specific 
sites amenable to cleavage with standard hammerhead 
ribozymes, and to restrict the analysis to those triplets that are 
cleaved most efficiently. Since the hanunerhead ribozyme 
seems to prefer a purine in the first position of the cleavage 
triplet and a C or an A in the third position (24), we decided to 
concentrate on the triplets GUC, GUA, AUG and AUA in our 
analysis. Four ODN libraries, each specific for one of these 
cleavage triplets, were synthesized and used to screen in vitro 
transcribed PSKHl RNA for accessible sites. 

Based on a study of the most active ribozymes expressed 
in vivo from a randomized library (25), the ribozymes used in 
the present study were designed to have symmetric 8+8 nt arms 
(Fig. 1) (26). Chemically synthesized ribozymes of similar arm 
lengths have subsequently been successfully employed 
(19,27-29). The stem II structure (Fig. 1) was chosen to be 2bp 
long (30). The activity of such truncated ribozymes in cell 
culture has been demonstrated (27,29). For increased nuclease 
stability of the ribozymes, we have retained unmodified ribo- 
nucleotides in only five catalytically important positions 
(31,32). Deoxyribonucleotides were used in the flanking arms 
(33,34) and 2'-0-alkylated ribonucleotides in the core and 
stem--loop II (31,32,35) (Fig. 1). Nuclease stability of 
ribozymes was increased fiuther by an inverted thymidine at the 
3'-end (28,29,32) and by a hexanol moiety at the extreme 5'-end 
(36). Ribozymes used in vivo frequendy include short stretches of 



MATERIALS AND METHODS 

All restriction enzymes were from New England Biolabs. 
Synthesis and purification of hammerhead ribozymes 

Automated RNA and DNA synthesis was carried out on an Applied 
Biosystems model 394 DNA/RNA synthesizer. 2'-0-Alkyl 
ribozymes containing five unmodified purine ribonucleotides were 
synthesized by solid phase P-cyanoethyl phosphoramidite 
chemistry (41), using the 2'-0-/err-butyldimethylsilyl protection 
strategy for the ribonucleotides (42,43). Syntheses were 
performed on controlled pore glass bearing an inverted thymidine 
linkage (Glen Research). A lipophilic capture tag, l-[Diiso- 
propyl(DL-a-tocopheryloxy)silyl]oxy-6-(2-cyanoethyl 
diisopropylamino-phosphinoxy)hexane, was added at the 5'-end 
of the oligomer as described (36). Cleavage from the support 
and release of all base labile protecting groups (44,45), reverse 
phase HPLC purification (Pharmacia Source 5RPC 10/10 
column, using a flow-rate of 1 ml/min, or a ^iBondapac C-18 
colunrm), lyophilization, desilylation (46), butanol-precipitation 
and counter-ion exchange with NaCl04 were performed essentially 
as described (36). Ribozymes, retaining a 5' hexanol-linker, were 
desalted (NAP- 10 columns, Pharmacia) and quantified by UV 
spectroscopy. Molar extinction coefficients were calculated based 
on the nearest-neighbor method (Biopolymer Calculator at 
http://paris.chem.yale.edu ). Exact molecular weights were 
calculated. Ribozymes were controlled by denaturing 15% 
polyacrylamide gel electrophoresis prior to their application in 
cell culture experiments. 

Sequence and modification of hammerhead ribozymes 

PSKHl-specific 2'-0-methylated ribozymes (MRz) were 
designed targeting a total of eight sites. Allylated ribozymes 
(ARz) targeting a subset of these sites were also synthesized in 
order to evaluate the relative effects of the two types of 
modification on the eflScacy of ribozymes. For both types of 
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ribozymes, controls having the sanie chemical composition 

and length of hybridizing arms were designed targeting an 
irrelevant mRNA (human tissue factor). Ribozymes are named 
after their type of modification and cleavage position. Thus 
MRz-519 is a methylated ribozyme cleaving after guc519, 
while ARz-519 is the corresponding allylated ribozyme. The 
respective control ribozymes are MRz-TF and ARz-TF. The 
unique sequences (flanking arms) of the ribozymes were as 
follows (the conserved sequence of the core is indicated by 
<core> for all but the first ribozyme sequence): MRz-TF/ARz-TF, 
A*A*T-C-T-C-C-T-c-u-g-a-u-g-a-g-g-g-u-u-a-c-c-g-a-a-a-C- 
T-T-A-G-T*G-iT; MRz-118, T*C*G-G-G-A-A-G- <core> -a- 
C-C-T-T-G-C*T-iT; MRz-238, C*C*G-G-C-T-T-T <core> -a- 
C-A-G-G-G-C*C-iT; MRz-287, A*G*T-C-G-G-G-G- <core> 
-a-C-C-G-G-G-G*C-iT; MRz'465/ARz-465, T*G*A-T-G-G- 
C-A- <core> -a-C-G-G-C-T-G*C-iT; MRz'5]9/ARz-5J9, G*C*A- 
G-C-T-C-C- <core> -a-OT-C-A-C-A*C-iT; MRz'539/ARz- 
539, A*C*G-C-A-C-C-C- <core> -a-C-G-C-A-G-C*A-iT; 
MRz-548, G*T*T-G'G-C-A-T- <core> -a-C-G-C-A-C-C*C- 
IT; MRz-m, A*G*A-T"A-C-C G- <core> -a C-G-C-C-A- 
T*C-iT. Modified and unmodified ribonucleotides are in plain 
and bold lower case, respectively, while deoxyribonucleotides 
are in upper case. An asterisk denotes a phosphorothioate 
linkage, while iT indicates an inverted 3'-3' thymidine (Fig. 1). 

SpeciOc and semi-randomized DNA ODNs 

Semi-randomized 13mer antisense ODNs specific for each of the 
target triplets GUC, QUA, AUG and AUA were synthesized and 
HPLC-purified. The ODNs were designed to mimic tfie hybridi- 
zing arms of symmetrically armed (6+6 nt) hammerhead 
ribozymes and had the following sequences (N denotes a 
randomized position): GUC-specific library, (N)^GAC(N)4; 
GUA-specific library, Q^)^kC(^\\ AUC-specific library, 
(N)6GAT(N)4; AUA-specific library, (N)6TAT(N)4. Specific 
ISmer antisense ODNs targeting the selected ribozyme 
cleavage sites were also synthesized. These ODNs are identilled 
by the cleavage position of their corresponding ribozymes, and 
have the following sequences: 0-118, GGGAAGGACCTTG; 
0-238, GGCTFTGACAGGG; 0-287, TCGGGGGACCGGG; 
0-465, ATGGCATACGGCT; 0-519, AGCTCCGACTCAC; 
0-539. GCACCCGACGCAG; 0-548, TGGCATGACGCAC; 
0-712, ATACCGGACGCCA. 

Plasmids 

The cDNA of the coding sequences of PSKHl was previously 
cloned in our lab and inserted into the expression vector 
pcDNA3 (Invitrogen) downstream of a T7 promoter, 
producing the plasmid pcDNA3-PSK. A Renilla luciferase 
expression plasmid, pEFl-Rluc, constructed by inserting the 
EFla promoter into the multiple cloning site of the pRL-null 
vector (Promega), was used as an internal control in co-trans> 
fection experiments. This plasmid was a kind gift from 
Professor A.-B. Kolst0. 

Preparation of FSKHl-luciferase fusion comsltnicts 

The full-length coding sequence of PSKHl cDNA was cloned 
in-frame with fu-efly luciferase cDNA into the BglWNcol sites 
of the pGL3 Enhancer expression vector (Promega) as a 
BamiSr^col PGR fragment, producing the plasmid pPSK-Luc. 
The PSK-Luc fusion was cloned into the tetracycline response 
plasmid pTRE (Qontech) in two steps. A Kprd-BamHl fragment 



of pPSK-Luc was first transferred to the EGFP- 1 vector (Clontech). 
Subsequently, an EcoRl-BamUl fragment from this clone was 
excised and cloned into the same sites of the pTRE vector, 
giving pTRE-PSK-Luc. Finally, PSK-Luc cDNA was also 
cloned into the pcDNA3 expression vector as an EcoRl-Xbal 
fragment for higher-level expression. This expression plasmid 
was termed pcDNA3-PSK-Luc. 

In vitro transcription of PSKHl RNA 

For in vitro transcription of PSKHl RNA with T7 RNA 
polymerase, the pcDNA3-PSK plasmid was linearized intern- 
ally with EcoNly producing a 1.03 kb transcript containing 
47 nt of 5' vector-derived sequence, or downstream with Xbal 
(producing a 1.37 kb transcript). Protein and salt were removed 
from restriction reactions using JetQuick columns (Genomed). 
Purified DNA was eluted in DEPC-treated water (DEPC-HjO), 
precipitated with ethanol, and resuspended in DEPC-H2O. Run- 
off transcription of uniformly ^^P-labeled RNA was performed in 
50 nl reactions containing 2 ^ig template DNA, 0.5 mM each of 
GTP, ATP, CTP and UTP (Boehringer Mannheim), 50 U RNasin 
(Promega), 1-2 |il 10 \iCi/\i\ [a-^^pj^^TP or rUTP (Amersham) 
and 50 U T7 RNA polymerase (New England Biolabs) in RNA 
polymerase buffer (40 mM Tris-HCl, 6 mM MgClj, 2 mM 
spermidine, 10 mM DTT, pH 7.9). Reactions were incubated 
for 2 h at 37°C. RNA was then desalted by centrifugation 
through RNase-free G50 Sephadex QuickSpin Columns 
(Boehringer Mannheim) and deproteinated by phenol extraction. 
RNA was then precipitated with isopropanol and resuspended 
in RNase-fi-ee water. Alternatively, RNA was purified by LiCl- 
precipitation (2.5 M final concentration) following transcription. 
Yield and concentration of RNA were calculated from the 
percentage of incorporated radioactivity. 

Antisense ODN and RNase H-mednated in vitro cleavage of 

PSKHl RNA 

RNase H-mediated cleavage assays with antisense ODN 
libraries were performed in 10 ^1 reactions containing 40 (JM 
ODN library, 50 nM PSKHl mRNA and 0.25 U RNase H 
(Promega) in a buffer containing 40 mM Tris-HCl pH 7.5, 
10 mM MgCl2 and 10 mM DTT. A stock dilution of RNA in 
buffer was heated to 95°C for 60 s and then preincubated at 
3TC for 15 min before adding enzyme. An 8.0 \i\ mixture of 
RNA, buffer and enzyme was then mixed with 2.0 \il 200 ^M 
ODN library, yielding the indicated final concentrations of 
components. Reactions were incubated for 30 min at 37'^C, and 
quenched on ice with 10 |il of denaturing loading buffer (8 M 
urea, 20 mM EDTA, 0.05% bromophenol blue, 0.05% xylene 
cyanol). Samples were heated to 95°C for 2 min prior to 
loading onto a 4% denaturing polyacrylamide gel. RNA size 
markers were prepared by run-off in vitro transcription of 
differently linearized (A^ct?I, Xmnl, BsaAl, Nael, BstUl, Fokl 
and DdeJ) pcDNA3 templates. Samples and standards were 
analyzed by electrophoresis for 3-4 h at 50 W at room temperature 
on 40 cm sequencing gels. The gels were then transferred to 
Whatman 3MM paper, wrapped in plastic foil and exposed 
overnight in a Phosphor Screen (Molecular Dynamics) prior to 
analysis on a radioanalytical scanner (Storm 860, Molecular 
Dynamics). Fragment sizes were calculated from their measured 
mobilities and those of the size markers. Reactions widi specific 
antisense ODNs (200 nM, 4-fold molar excess) were performed 
essentially as for the ODN libraries. 
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Cultured cells 

HeLa cells (from ATCC) were maintained in DuIbecco*s 
Minimal Essential Medium supplemented with 2 mM 
glutamine and 10% fetal calf serum (all reagents from Gibco 
BRL). Upon thawing, the cells were grown to near confluency 
for 2-3 days and passaged at least once before they were used 
for experiments. Cells were routinely passaged every 3-4 days. 

IVansient co-transfections 

Cells were plated in 1 2- well (Costar) or 24- well (Sarstedt) 
plates at 3{M0% confluency and transfected at an estimated 
60-80% confluency the following day. Transfections were 
performed with 100 nM ribozyme (1.23-1.30 (ig/ml), 0.40|ig/ml 
reporter construct (pTRE-PSK-Luc or pcDNA3-PSK-Luc) and 
8 ng/ml internal control plasmid (pEFl-Rluc). Nucleic acids 
were complexed with lipofectamine (Gibco BRL) (a final 
concentration of 8.2-8.5 ^ig/ml lipid) at a constant 1:5 w/w 
ratio, following optimization with the transfection agent. 
Complexes were formed by mixing equal volumes of medium- 
diluted nucleic acids (ribozyme + DNA) and lipofectamine and 
incubating at room temperature for 30 min. The complexes 
were subsequendy diluted to the final volume with serum-free 
medium and added to pre- washed cells (250 and 500 \i\ for 16 
and 22 mm wells, respectively) for 5 h. The complexes were 
subsequendy replaced with fiill medium. 

Uptake of FITC-labeled ribozymes 

Uptake of FITC-Iabeled ribozymes under the standard trans- 
fection conditions was determined by fluorescence spectro- 
scopy. Lipofectamine complexes were diluted with medium 
and chilli on ice prior to addition to cells (in 12- well plates) 
preincubated (for at least 30 min) either at 37°C or on ice. 
Following transfection for 5 h, cells were harvested by 
washing three times with ice cold phosphate-buffered saline 
(PBS) and lysed with 1.5 ml PBS-TDS (1% Triton, 0.5% 
deoxycholate and 1% SDS in PBS) for at least 15 min at room 
temperature under vigorous shaking to achieve complete lysis. 
Parallels were combined and fluorescence recorded on a 
spectrometer (LS-5, Perkin Elmer) at excitation and emission 
wavelengths of 492 and 522 nm. respectively. 



Luciferase activity assays 

Cells were harvested 24 h after initiation of transfection and 
washed twice in cold PBS prior to lysis for 30 min on ice with 
a passive lysis buffer supplied with the Dual-Luciferase® 
Reporter Assay System kit (Promega). Following brief centri- 
fugation to pellet cell debris, luciferase assays were performed 
in white non-transparent 96-weil plates (Nunc) using a plate- 
reading luminometer (MicroLumat Plus, EG&G Berthold) 
equipped with two injectors. Dual-luciferase assays were 
performed on 25 \l\ lysate supernatant. Firefly and Renilla 
luciferase activities were recorded following the respective 
injections of 100 [il LAR 11 and 100 nl Stop&Glow reagents of 
the assay system according to the manufacturer's instructions. 
The instrumental background levels of luminescence were 
recorded for empty wells. 

RESULTS 

Selection of rilK>zyme target sites by MFold secondary 
structure prediction 

We have used the MFold program (version 3.0 at http:// 
mfold2.wustl.edu ) (7,8) to predict the secondary structure of 
PSKHl mRNA as a means of selecting GUC sites for 
ribozyme targeting. The MFold program generated many 
suboptimal secondary structures that differed minimally in 
energy from the optimal folding. Although these secondary 
structures displayed varying degrees of folding differences, 
certain substructures could be identified that demonstrated a 
considerable degree of conservation. High conservation of 
these substructures among alternate suboptimal structures may 
increase their likelihood of being 'tme' structures (i.e. contained 
in the actual secondary structure of the mRNA in vivo). Therefore, 
in selecting GUC sites for hammerhead ribozyme targeting, we 
have screened 10 optimal and suboptimal secondary structures 
of the first 1 .0 kb of the PSKHl transcript, as transcribed from 
pTRE-PSK-Luc, for recurring substructures. All selected 
target sites were located within identical substructures in at 
least six out of die 10 most energetically stable structures, 
including the optimal structure (Table 1). For some target sites, 
several of the suboptimal structures diverged only slightfy 
from the consensus in ways that did not significantly change 
the basic characteristics of the substructure. Due to the lack of 



Table 1. Features of the predicted secondary structure of selected ribozyme target sites 





gucliS 


guc238 


guc287 


gua465 


guc5l9 


guc539 


guc548 


guc712 


Structural frequency 


6/10 


6/10 


6/10 


6/10 


6/10 


7/10 


7/10 
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Major loop 


5 
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3 


0 


3 


3 


Major stem 


6 


8 


3 


4 


6 


17 


10 


11 


AG (helix)" 


-12.7 


-20.1 


-5.7 


-8.8 


-8.1 


-n.4 


-9.7 


-13.1 


AG (target)'' 


-13.2 


-19.4 


-6.7 


-9.4 


-14.4 


-18.1 


-15.4 


-8.3 


AG(duplex)<= 


-32.9 


-36.0 


-40.2 


-35.2 


-34,1 


-36.2 


-34.0 


-31.7 



Structural frequency denotes the number of times the target site substructure occurred unchanged in the 10 energetically most 
stabilized foldings of the mRNA. Major loop and stem are the longest stretches of predicted unpaired and paired target sequence 
bases, respectively. Free energies (AG) were calculated for the most stable helical region within the target sequence*, for the target 
sequence as a whole"*, and for the hammerhead-RNA duplex^, as detailed in the text. Energies arc in kcal/mol. 
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Figure 2. Excerpts of the energetically optima] secondary structure of the first 1000 bases of the pTRE-PSK-Luc transcript, as predicted by MFold version 3.0 (7,8). 
Target sequences of the ribozymes are highlighted. Arrows indicate position of cleavage. 



a consensus on what kind of structures make the best targets, 
we have chosen GUC targets (guc238, guc287, guc519 and 
guc712) that are predicted to be located within very different 
substructures (Fig. 2). Hie guc238 cleavage triplet is located 
within a large internally looped stem in which the 5' arm 
(stem I) of the corresponding ribozyme is complementary to 
die nucleotides of the loop. The guc287 target sequence, on the 
other hand, is predicted to fold into a short stem-loop structure. 
The guc5 19 target site is located at the end of what is essentially a 
very long basepaired region that is interrupted by occasional 
short bulges. A ribozyme was also designed targeting guc712, 
located at a particularly stable hairpin structure at the end of a 
very long, and presumably very stable, stem. This target site 
was included due to the high conservation of its secondary 
structure among suboptimal structures of the target RNA 
(identical in 10 out of 10 and 19 out of the 20 energetically 
most favorable structures). In addition to the above target sites, 



the first GUC site in the RNA (gucl 18) was selected due to its 
close proximity to the translation initiation region (only -20 nt 
downstream of the initiation codon). This region has often 
been targeted (12,33,47,48) because the RNA in this region 
might be relatively open due to the need for binding of the 
components of the translation machinery. The predicted 
secondary structure for the gucl 18 target consists of a stem 
with two internal loops (Fig. 2). 

In vitro accessibility assays with semi-randomized ODN 
libraries and specific ODNs 

Target triplet-specific, semi-randomized ODN libraries were 
used to screen in vitro transcribed PSKHl RNA for hybridization- 
accessible GUC, GUA, AUC and AUA sites. Accessible sites 
were identified by RNase H-mediated cleavage of the RNA at 
the RNA-DNA hybrids generated by hybridization of anti- 
sense ODNs to target RNA (19-21). Screening of a 1.03 kb 
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Figure 3. In vitro accessibility assays. (A) Separate screening of a 1 .03 kb 
PSKHI in vitro transcript with triplet-specific ODN libraries. Reactions were 
peiformed with 50 nM RNA and 40 fiM ODN libraiy as detailed in Materials 
and Methods. Arrows indicate pairs of fragments resulting from cleavage ai 
subsequently selected ribozyme target sites. (B) Cleavage of PSKHI RNA 
(SO nM) with molar excess (200 nM) of specific ODNs against ribozyme 
target sites selected by oligo library screening (0-465, 0-5 19» 0-539, 0-548) 
and structural considerations. 



PSKHI transcript from the pcDNA3-PSK plasmid was 
performed separately with each of the four ODN libraries 
(Fig. 3 A). Since the transcript was uniformly labeled, cleavage at 
any position produced two cleavage products. This complicated 
the interpretation of the data since each pair of cleavage fragments 
suggested two possible sites of cleavage. Screening reactions 
were subsequently performed on a longer RNA transcript 
(1.37 kb) that differed from the first transcript in its 3'-end. 
This facilitated the interpretation of cleavage data, as 5' cleavage 
fragments were identical for the two RNAs. Screening of two 
differently sized transcripts increased confidence in the results. 
Any accessible sites that are not identified in the longer tran- 
script are likely to be due to structural features of the specific 
transcript that may not be represented in the full-length 
mRNA. Such sites are more likely to be encountered at the 
ends of the transcripts since the local folding of these regions will 
be most strongly affected by the lack or presence of additional 
sequences. Consequently, any cleavage sites originating within 
the first or last 200 nt of the transcript were ignored in the 



selection of accessible sites by library screening, as were cleavage 
sites that could not be detected in the longer transcript. 

The screening assays demonstrated several reproducible 
cleavage sites that obeyed the above restrictions. Most 
cleavage sites were identified with the GUA and GUC 
libraries, while no useful sites were identified with the AUA 
library (Fig. 3 A). Screening of PSKHI RNA with the GUA 
library produced two very prominent cleavage fragments that 
were not produced by any of the other libraries, demonstrating 
their GUA triplet specificity. The sizes of these two fragments 
were estimated at -615 and 405 nt. Their combined estimated 
size of 1020 nt was in good agreement with the transcript 
length (which was 1030 nt). The shorter fragment was shown 
to be the 5' cleavage fragment, as it was also present when 
screening the longer RNA transcript (data not shown). The size 
of this fragment was consistent with cleavage near a site 
corresponding to position 460 in the PSKHI cDNA. Examination 
of the sequence indicated a suitable GUA triplet at position 
463-465 as the only probable site of cleavage. Screening with 
the GUC library produced six closely spaced fragments with 
estimated sizes ranging from 470 to 560 nt which fitted nicely 
into three pairs with combined sizes of the expected length 
(1020-1030 nt). By a similar analysis as for the GUA library 
cleavage fragments, the putative cleavage sites were identified 
as guc519, guc539 and guc548, respectively. One of these 
sites, guc519, had previously also been selected on the basis of 
MFold secondary structure predictions (see above) prior to the 
antisense library screening. The AUC library screening also 
produced several fragment pairs, but these were either weaker 
than their GUA and GUC counterparts, or could not be unam- 
biguously assigned to a specific triplet. The screening assays 
were performed with a ratio of ODN (40 |iM) to target RNA 
(50 nM) of 800:1, which given the complexity (4*<>= 1 x 10^) 
of the libraries corresponds to a ratio of specific ODN 
(40 pmol) to RNA of roughly 1 : 1250. This might seem too low 
a ratio to reasonably account for the degree of cleavage 
observed. However, the observed cleavage would correspond 
to a turnover of only -100 molecules/h. Furthermore, the 
concentration of ODNs that can productively hybridize to a 
given target site may be significantly higher as mismatches 
corresponding to the ends of the ODNs are not expected to 
severely impair hybridization efficiency. Non-triplet-specific 
cleavage events (involving ODNs with mismatched or wobble- 
paired triplets, as well as partial hybridization of only the 
random-armed portions) may also conceivably occur, although 
they are less likely to account for the major cleavage events. 
Non-specific cleavage events would also be expected to result 
from screening with more than just one library. Examples of 
such cleavage events were observed, but did not include the 
selected cleavage sites. Nonetheless, to ensure that putative 
ribozyme target sites were properly identified, the provisional 
identifications were subsequently confirmed by cleavage of the 
RNA with the corresponding specific antisense ODNs. In all 
cases, the cleavage fragments produced by the specific ODNs 
co-migrated with the corresponding cleavage fragments 
produced by the library screenings (data not shown). These 
results demonstrate the utility of triplet-specific semi-randomized 
ODN libraries in identifying potential target sites. 

In vitro cleavage assays were performed with specific ODNs 
against all sites targeted by ribozymes to investigate whether 
the sites that were selected by library screening were more 



Nucleic Acids Research, 2000, Vol, 28, No, 21 4119 



accessible than target sites selected by alternative means. The 
specific ODNs, like those of the semi-randomized libraries, 
were designed to have the same recognition sequences as a 
corresponding hypothetical 6+6 armed hammerhead ribozyme. 
The in vitro cleavage assays demonstrated that all the ODNs 
targeting sites that were selected by library screening resulted 
in stronger cleavage than the best among the ODNs targeting 
sites selected by theoretical/structural considerations (Fig. 3B). 
Furthermore, the strongest cleavage was achieved with the 
ODN (0-465) corresponding to the GUA site that was cleaved 
most strongly in the library screenings (Fig. 3 A). Thus accessibility 
data obtained from library screening assays were representative 
for die hybridization efficiencies of the corresponding unique 
ODNs. Consequently, major cleavage events do not appear to 
be substantially influenced by the presence of non-specific 
ODNs in the library screening. This suggests that neither 
positive nor negative cooperativity of binding (49) occurs to a 
significant degree under our assay conditions. 

Cellular upttake of ribozyme 

FITC-labeled control ribozymes were used to investigate the 
cellular uptake of ribozyme under the optimized co-transfection 
conditions by spectroscopy. In order to distinguish between 
intracellular uptake and membrane association by spectro- 
scopic analysis, uptake experiments were performed in parallel 
by incubating cells on ice as well as at Similar amounts 
of MRz-FITC and ARz-FITC ribozymes were found to be 
associated with cells following incubation both on ice (26-28% of 
total added ribozyme) and at ZTC (58-59% of added 
ribozyme) (data not shown). The increase in cell-associated 
fluorescence at the higher temperature, suggests that in excess of 
30% of ribozyme has been internalized. The cellular association 
of ribozyme following incubation on ice was shown to have a linear 
dependency on ribozyme concentration in the range 0-100 nM 
(data not shown). This is consistent with a state of equilibrium 
between membrane-bound and free (in medium) ribozyme. 
Assuming that this equililxium is maintained upon internalization 
of ribozyme at the permissive temperature, the amount of inter- 
nalized ribozyme is determined by the formula / = 71(P-a)/(r-a), 
where T is the total amount of ribozyme added to cells, and a 
and P are the measured cell-associations upon incubation on 
ice and at 37^C, respectively. Net uptake of ribozyme under the 
given transfection conditions was estimated at 42-45% (data 
not shown). Co-transfection optimization experiments 
performed at various w/w ratios ( 1 : 1 to 6: 1 ) of lipofectamine to 
total nucleic acids demonstrated that uptake was only slightly 
influenced by lipofectamine concentration above a certain 
threshold (2:1 w/w ratio) that constitutes a molar excess of 
positive charges (from cationic liposomes) in the complexation 
mixture. Below this threshold (at a 1:1 w/w ratio, yielding 
complexes of net negative charge), uptake was severely 
impaired (data not shown). 

Efficacy of ribozymes in co-transfection experiments in 
HeLa cells 

The in vivo efficiency of the selected ribozymes was evaluated in 
a cell culture assay in which 1(K) nM of ribozyme was co-trans- 
fected with a plasmid coding for a PSKHl-luciferase fusion 
protein. In order to correlate for transfection variability and 
improve experimental reproducibility, a plasmid coding for 
Renilla luciferase (Rluc) under the control of an EFla-promot^- 



was added to the transfection mixture as an internal control. 

This control proved to be essential as the general expression 
levels of Renilla luciferase varied up to 2-3-fold within the 
same experiment for different ribozymes (data not shown). 
Experiments were performed as far as possible with the full 
complement of ribozymes of identical chemistry so that in 
each experiment, the efficacies of ribozymes were compared 
under identical conditions. In each experiment, the relative 
firefly luciferase activity for all ribozymes was normalized to 
the levels for the control ribozyme (the expression of which 
was set at 100%). Normalization was always performed 
relative to the relevant control ribozyme. The data from experi- 
ments with both methylated and allylated ribozymes are 
summarized in Figure 4. The most effective methylated 
ribozyme in co-transfection assays proved to be MRz-287, 
targeting guc287. This ribozyme reduced the level (normalized) 
of luciferase expressed from the reporter gene to -23% of the 
control levels, i.e. the levels in cells treated with irrelevant 
control ribozyme. The second most efficient ribozyme was 
MRz-519, which resulted in -65% inhibition of expression. 
The ribozymes targeting sites gucllS and guc548, exhibited 
similar inhibitory effects, with residual expression levels just 
over 40%. At the other end of the spectrum we find the 
ribozymes targeting guc712 and guc539, which result in only 
-30% inhibition. 

Three of the ribozymes were synthesized also in the allylated 
version to evaluate any systematic differences in inhibitory 
capacity of methylated and allylated ribozymes. The effect of 
the allylated ribozymes varied over a relatively wide range, 
resulting in residual reporter gene expression ranging from 
-35% for ARz-519 to 95% for ARz-539 (Fig. 4). Comparing 
the data for these ribozymes with those of their methylated 
counterparts, we observed the same activity ranking. For both 
types of modification a ribozyme targeting the guc519 
cleavage site was superior to the ribozymes targeting the two 
other sites, gua465 and guc539, of which the latter site 
appeared to be least amenable to cleavage. This constitutes 
evidence that the observed inhibitory effects of these 
ribozymes are sequence specific. The difference in activity of 
methylated and allylated versions of otherwise identical 
ribozymes varies with the target of the ribozymes. While the 
two types of ribozymes targeting guc5 19 are equally active, the 
allylated ribozymes targeting gua465 and guc539 are less 
active dian their methylated counterparts. The combined data 
suggest that methylation generally results in superior 
ribozymes and, further, that the allyl modification seems to be 
more detrimental to ribozyme efficacy when targeting poorly 
accessible sites. Reduced efficacy of allylated ribozymes is not 
due to differences in intracellular uptake, as the methylated and 
the allylated FITC-labeled ribozyme were internalized to the 
same level by lipofectamine-mediated transfection (see uptake 
experiments). 

Ribozyme effects depend on ribozyme concentration and 
target gene promoter strength 

The concentration dependence of the ribozyme effect was 
investigated in co-transfection experiments in which the 
concentration of the active anti-PSKHl ribozyme was varied 
and the total concentration of ribozyme adjusted to 100 nM 
with control ribozyme. For dose-dependence experiments, the 
most active allylated (ARz-519) and methylated (MRz-287) 
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Figure 4. Lipofectamine-mediated co-transfection of HeLa ceils with ribozyme, pTRE-PSK-LAic, and pRLuc as detailed in Materials and Methods. Eight methylated 
(MRz-) and three allylaled (ARz-) PSKH 1 -specific ribozymes were analyzed together with their respective controls (MRz-TF and ARz-TF). PSKHl -dependent firefly 
luciferase expression was normalized to Renilla luciferase expression for each sample. Normalized expression in cells transfected with control ribozymes within 
each series was set at 100%. Data are averages of 4-7 independent experiments. Expression levels are indicated above the error bars (+SD) of each column. 



ribozymes were selected. These experiments confirmed the 
dose-dependence of the ribozyme effect (Fig. 5A). We next 
wanted to investigate whether increasing the steady-state target 
gene expression would reduce the specific inhibitory effect of 
ribozymes. Higher expression of the reporter was achieved by 
placing it under the transcriptional control of the CMV 
promoter in the pcDNA3 expression plasmid. This resulted in 
20-fold enhancement of reporter expression (data not shown) 
compared to the weaker tetracycline-responsive promoter. In 
co-transfection experiments, this increase in reporter expression 
was accompanied by reduced apparent efficiencies of three of 
the most active PSKHI ribozymes (Fig. 5B). The average 
levels of inhibition of target gene expression with the weak 
promoter were 73, 63 and 51% for ribozymes MR2-287, MRz-519 
and MRz-548, respectively. With the strong promoter, yielding 
the 20-fold higher expression of reporter (measured in control 
ribozyme-treated cells), their inhibitory activities were 60, 43 
and 33%, respectively. Thus the inhibitory effect of ribozymes, 
as expected, was dependent on the concentrations of both 
ribozyme and target, but the ribozymes were able to inhibit 
almost completely the 20-fold increase in the target mRNA. 

Comparing ribozymes selected by MFold secondary 
structure prediction and in vitro accessibility assays 

Ribozymes were ranked based on their inhibitory activity in 
co-transfection assays, with tied ranks given to ribozymes 



resulting in residual expression levels differing by <3% 
(Table 2). This classification can be used to assess the relative 
merits of the two methods of target site selection. The four 
ribozymes that were selected on the basis of library screening 
data (MRz-465, MRz-519, MRz-539. MRz-548) were compared 
to those selected on the basis of structural considerations 
(MRz-118, MRz-238, MRz-287, MRz-519, MRz-712) by 
Wilcoxon's rank sum test on unpaired samples (50). The rank sums 
for the two groups of ribozymes were 18 (average rank = 4.5) and 
20 (average rank = 5.0), respectively, which are higher than the 
critical rank sum of 1 1 required for a 5% significance level. 
From these data, ribozyme targets selected on the basis of 
in vitro accessibility are no more susceptible to ribozyme 
cleavage in vivo than those selected by structural prediction or 
other considerations. 

Correlation of ribozyme efficacy with the predicted target 
secondary structure 

The two methods of target selection both resulted in ribozymes 
with very diverse inhibitory activities. We therefore decided to 
investigate whether there was any correlation between 
predicted structure of target sites and the in vivo efficacy of 
corresponding ribozymes. We approached this by attempting 
to break down the secondary structures of the target sequences 
in readily quantifiable parameters. In doing so we have 
disregarded certain factors that may influence hybridization 
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Table 2. Rank correlations of ribozyme efficacy with various features of the predicted target site secondary structure 



Ribozyme 


Efficacy 


AC (helix) 


Stem 


Siem+helix 


Loop 


AC (target) 


AC (duplex) 


MRz-287 


1.0 


1.0 


1.0 


1.0 


4.0 


1.0 


1.0 


MRz-519 


2.0 


2.0 


3.5 


2.8 


6.0 


5.0 


5.0 


MRZ-M8 


3.5 (3.0) 


6.0 


3.5 


4.8 


3.0 


4.0 


7.0 


MRz-548 


3.5 (4.0) 


4.0 


6.0 


5.0 


6.0 


6.0 


6.0 


MRz-465 


5.0 


3.0 


2,0 


2.5 


2.0 


3.0 


4.0 


MRz.238 


6.0 


8.0 


5.0 


6.5 


l.O 


8.0 


3.0 


MRz-712 


7.5 (7.0) 


7.0 


7.0 


7.0 


6.0 


2.0 


8.0 


MRz-539 


7.5 (8.0) 


5.0 


8.0 


6.5 


8.0 


7.0 


2.0 


Pearson's correlation 




0.75 (0.69) 


0.77 (0.80) 


0.84 (0.83) 


0.16 


0.35 


0.18 


coefficient (r) 
















Significance 




0.025 (<0.05) 


<0.025 


0.01 


>0.05 


>0.05 


>0.05 


level {P) 

















Ribozymes were ranked according to inhibitory activity in co-transfection assays (lowest rank for highest activity). Target sites were ranked according to free 
energy of linfiiting helix and target sequence (lowest rank for highest energy), stem length (lowest rank for shortest stems), loop length (lowest rank for largest 
loops), and the free energy of the ribozyme-RNA duplex (lowest rank for lowest enegy) (Table 1). Tied ranks were allotted to targets with the same length of stem 
and loop and to ribozymes resulting in residual expression levels differing by at most 3%. When considering stem and helix together, the average of the two 
individual ranks was used. Correlation coefficients have also been calculated for the case in which ribozymes were ranked without ties (numbers in parentheses). 
This did not significantly alter the results. 



efficiency but which cannot be easily quantified. Such factors 
include the positioning of single-stranded stretches within the 
target sequence and the ability of a specific hammerhead 
sequence to assume its proper catalytic three-dimensional 
structure. The presence of so-called free ends, the positioning 
of single-stranded stretches at the ends of target sequences, 
may be expected to be correlated with enhanced antisense 
binding (12), while elements of strong secondary structure in 
the neighborhood of the target sequence proper might impede 
ribozyme folding. We do not imply that these factors are 
unimportant, merely that as they cannot be easily quantified, 
other more readily quantifiable parameters should be investi- 
gated for predictive value. Three potentially important para- 
meters for target site accessibility were local free energy of 
folding of the target sequence (11), the size of single-stranded 
stretches (loops) that may function as * hooks* for nucleation of 
duplex formation (51,52), and the length and stability of stems 
and helices. Stems and helices may need to be opened up for 
full hybridization of ribozyme to occur and their lengdi and 
stability may therefore influence hybridization efficiency. 
Consequently, the target sequences of all ribozymes were 
decomposed into length of the major loop, stem and helix, 
while free energies have been calculated both for the target 
sequence as a whole and for its major helical region (Table I). 
Target sequence free energies were calculated for the energeti- 
cally optimal folding of the RNA, by adding up the energy 
contributions from all base-pairings, stacking interactions, 
bulges and various loops that are contained within the target 
sequences, as indicated by the MFold program. MFold was 
also used to determine the free energy of the most stable 
(limiting*) helical region and to estimate the strength of the 
ribozyme-target hybrid (by folding the coiresponding c/>acting 
ribozyme in which catalytic and substrate strands were 
connected through a 5 nt loop at stem I). 



Spearman's rank correlation test (53) was used to investigate 
the level of correlation of ribozyme efficacy ranking with 
various features of the predicted secondary structure of the 
target sequences (Table 2). No correlation was observed with 
the length of the major loop (Pearson's correlation coefficient 
r = 0. 1 6) or ribozy me-substrate duplex free energy (r = 0. 1 8). The 
correlation with target sequence fiie energy was weak (r = 0.35) 
and not significant. Ribozyme efficacy was, however, significantly 
correlated (P <0.025) with both the length of the major base- 
paired stretch (r = 0.75) and the energy of the most stable helix 
(r= 0.77) within the target sequence (Table 2). Correlation was 
improved (r = 0.84) when considering stem length and helix 
stability together (for this analysis, the rank was taken as the 
average of the two individual ranks for stem and helix). Corre- 
lation coefficients were not significantly affected by ranking 
ribozyme activity without the use of ties (Table 2). 

DISCUSSION 

In this study we have analyzed DNA-armed chemically 
modified hammerhead ribozymes targeting eight GUC and GUA 
sites selected by two different methods, in vitro accessibility 
assays and MFold prediction. In a co-transfection controlled 
assay, the ribozymes resulted in residual luciferase reporter 
gene expression ranging from 23 (MRz-287) to 72% (MRz-539) 
(Fig. 4). This activity may in part be due to the presence of 
DNA arms which allow RNase H-mediated cleavage of DNA- 
RNA hybrids. Target gene expression normalized to the 
expression of a co-transfected non-target gene allowed the control 
of any non-target-specific sequence effects of the ribozymes. 
Non-specific effects are occasionally encountered in association 
with extended stretches of phosphorothioate (P=S) linkages 
(19). In an attempt to minimize such effects, no more than two 
consecutive I^S linkages were incorporated. Finally, 
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Figure 5. Dependence of ribozyme inhibilwy activity on ribozyme concentration 
and reporter gene expression. (A) Transfecdon of HeLa cells with increasing 
concentrations of ARz-519 or MRz-287 ribozymes. Total concentration of 
ribozyme is adjusted to 100 nM with control ribozyme (ARz>TF or MRz-TF) 
and transactions performed after the standard protocol. Data for representative 
experiments are shown. (B) Parallel co-transfections of HeLa cells with 
selected ribozymes and different reporter constructs (pTRE-PSK-Luc and 
pcDNA3-PSK-Luc). Relative expression levels in control cells were 20-fold 
higher with the pcDNA3-PSK-Luc construct compared to pTRE-PSK-Luc. 



ribozymes of different chemistry (with methylated RNA 
instead of DNA in the arms) against two of the targets (guc519 
and gucS48) were less efficient in inhibiting target gene 
expression than their DNA-armed counterparts (unpublished 
data). In combination, the above data make it unlikely that the 
observed effects are due to aspects of ribozyme sequence or 
chemistry unrelated to hybridization-specific activity. 

The inhibitory effect of ribozymes was dependent on both 
the ribozyme concentration (Fig. 5A) and the steady-state 
expression levels of the reporter gene (Fig. 5B). A reduced 
inhibitory effect was observed for three selected methylated 
ribozymes when expressing the reporter gene from a stronger 
promoter resulting in 20-fold higher expression levels 
(Fig. 5B). However, the molar amount of target suppression 
when the stronger promoter was used increased to a degree 
almost matching the increase in the level of the exogenous 
target. This interesting fact suggests an enzymatic rather than a 
stoichiometric effect of the ribozymes on target gene expression. 
The enzymatic function may derive entirely from the ribozyme 
or include RNase H activity. It should, however, be noted that 
the increase in the total level of PSKHl mRNA by expressing 



the exogenous transcript from the stronger promoter depends 

also on the unknown level of the endogenous transcript, which 
is another target for the ribozyme. The level of suppression 
achieved with the best ribozyme in this study (77%) is similar 
to that which has previously been reported for the best of 
several unmodified ribozymes in a similar luciferase reporter 
gene co-transfection assay (16). Other comparable studies 
have reported inhibition levels in the range of 40-80% for a set 
of 15 ribozymes (54), 50% inhibition with a pair of variously 
modified ribozymes (29), and 40-55% inhibition for a pair of 
2'-F-pyrimidine modified ribozymes with phosphorothioates 
at both ends (27). Thus our results, achieved with generally 
more extensively modified ribozymes, compare favorably with 
previous reports. 

We have also attempted to evaluate the relative effects of 
two commonly employed types of 2'-0-modifications on the 
activity of ribozymes. The same rank order of ribozymes 
targeting three selected sites was observed for both methylated 
and allylated DNA-armed ribozymes (Fig. 4). Allylated 
ribozymes were generally less efficient in inhibiting reporter gene 
expression in co-transfection assays compared to their methylated 
counterparts. The difference in activity of methylated and 
allylated versions of otherwise identical ribozymes appeared to 
correlate with the susceptibility of the target to inhibition by 
ribozyme. While the ribozyme targeting the least accessible of 
the dually target sites (guc539) apparently was most sensitive 
to the type of alkylation, ribozyme species of either modification 
were equally efficient when targeting die most susceptible site 
(guc5i9). 

Predicted secondary structures of targets selected on the 
basis of the MFold program were very diverse and included a 
short stem-loop (guc287), a stem with a large internal loop 
(guc238), a bulged stem (guc519), and a hairpin structure 
(guc712), as well as a site near the translation initiation site that 
was presumed to be relatively unstmctured (Fig. 2). In vitro 
accessibility assays with cleavage triplet-specific ODN-libraries 
identified three accessible GUC sites and one GUA site 
(Fig. 3A). In vitro cleavage assays with specific ODNs against 
all sites targeted by ribozymes confirmed that the sites that 
were selected by library screening were indeed more accessible 
in vitro than sites selected by alternative means (Fig. 3B). This 
confirms the utility of such semi-randomized libraries for 
identifying the most accessible sites in vitro. However, 
ribozymes targeting sites selected on the basis of in vitro 
accessibility assays were no more efficient in inhibiting target 
gene expression in a co-transfection assay than ribozymes 
targeting sites selected by theoretical means. In fact, the 
ribozyme targeting the most accessible site in vitro (MRz-465) 
ranked only as the fifth most active (Table 2), while the target 
site of the best ribozyme (MRz-287), was relatively inaccessible 
in vitro (Fig. 3B). The lack of correlation between in vitro 
accessibility and in vivo efficacy data is consistent with 
previous observations of other researchers (55-58). A possible 
explanation for the poor correlation is that the target mRNA is 
folded differently in vitro than in vivo. The secondary structure 
of the folded RNA may not be the energetically most stable 
(fast local folding events may prevent more energetically favorable 
interactions between distal regions). Structural features that 
promote hybridization in vivo and in vitro may also differ. 
Furthermore, the hybridization efficiency of 13mer antisense 
ODNs may not be entirely representative for the hybridization 
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characteristics of longer (32mer) hammerhead ribozymes with 
varying degrees of secondary structure of their own. Finally, 
the generation of higher-order mRNA structures and modulation 
or masking of the mRNA secondary structure by RNA-binding 
proteins (9,10) or protein complexes (ribosomes) (59) in vivo 
may influence accessibility of target sites, although these 
factors would also tend to rule out the applicability of 
secondary structure predictions for target site selection. Recent 
data suggest that performing in vitro accessibility assays on an 
endogenous transcript in a protein environment (cell extracts) 
may improve the accuracy of the predictions (14,22). 

Although in vitro accessibility assays proved to be of limited 
predictive value for the in vivo situation, correlative studies 
suggested that secondary structure predictions might have 
some merit. Significant correlation was found between 
ribozyme efficacy and the presence of short stems and energetically 
unstable helices within the ribozyme target sequence (Table 2). 
Ranking of ribozymes according to these two criteria, the 
relative efficacies of ribozymes were predicted nearly perfectly, 
the only significant discrepancy being the ribozyme targeting 
gua465. As well as being most accessible in vitro, this site also 
has a secondary structure that according to the above criteria 
should make it a significantly better target site than observed 
here. Other factors, such as the lack of single-stranded bases near 
the ends of the target sequence or a prohibitive environment for 
ribozyme folding, may explain the results. Notwithstanding 
this discrepancy, there is an apparently clear correlation 
between ribozyme efficacy and predicted target sequence 
secondary structure. Furthermore, the combined data from this 
study suggest that the previously reported correlation of 
ribozyme efficacy with local folding potential (11) may be 
incidental. Our data suggest that high target sequence free 
energy alone may not be sufficient for efficient ribozyme 
targeting. We propose that the above correlation is a conse- 
quence of the need to have some unpaired regions to facilitate 
fast nucleation of duplex formation (51,52), combined with 
short base-paired regions and helices of low stability that 
easily open up. Fulfillment of these criteria will in many cases 
result in a low local folding potential (high free energy) for the 
target site. 

A recent study on the effect of varying RNA secondary 
structure on the efficiency of specific antisense ODNs 
concluded that target sequences located within regions 
designed to be unstructured were most effective, while targets 
within stable stem-loop structures were ineffective (13). 
Recent reports by Patzel and co-workers (12,14) described a 
theoretical approach for antisense ODN target site selection 
based on the prediction of large single-stranded stretches 
(loops) by MFold. Our data do not support a correlation of 
ribozyme activity with the length of loops, possibly because all 
our targets had shorter predicted loops than recommended in 
the above studies. Our hypothesis and the conclusions of the 
above studies are, however, not mutually exclusive. Target 
sites containing very large loops will have a good probability 
of also containing short helical regions, which we propose to 
be the limiting factor. In fact, applying our hypothesis of target 
site evaluation to three previously well characterized target 
sites (t351, t398, t498) (14,22) within mRNA for murine DNA 
methyl transferase, the same rank susceptibility of targets, in 
perfect accordance with actual inhibition data, is predicted by 
both theories. 



In addition to the structure of the target site, the composition 
of the target sequence may also be of some importance. 
Sequences with a high G+C content will hybridize more efficiently 
with the complementary arms of their ribozymes and possibly 
increase the efficacy of the ribozyme. Although a general 
correlation between hybrid stability and ribozyme efficacy was 
not supported by our data (Table 2), it is worth noting that the 
most efficient ribozyme, MRz-287, has a substantially higher 
affinity for its target sequence than the other ribozymes 
(Table 1) due to an unusually high G+C content (13 out of 
16 nt). In conclusion, the target sequence of MRz-287 represents 
the proposed desirable structural features for a good ribozyme/ 
antisense target site. The target sequence consists of alternating 
short stretches of paired and unpaired bases, which limit stems 
and helical regions to no more than 3 bp. AH other target 
sequences fold into secondary structures containing longer and 
more stable helical regions. One study has reported that 
hybridization accessibility for hammerhead ribozymes is 
correlated with the presence of unpaired bases near the 
cleavage triplet (60). Although our data do not suggest this to 
be a critical requirement for in vivo activity, the above criterium is 
also fulfiled for the guc287 target site, as the longest single- 
stranded stretch is situated around the cleavage triplet and 
includes the base preceding the scissile bond (Fig. 2). All of the 
above mentioned features of guc287 add up to a very effective 
ribozyme target site, in good agreement with its observed 
inhibitory capacity in HeLa cells. 

In conclusion, our study indicates that there is a poor corre- 
lation between the apparent in vivo accessibility of a target and 
its accessibility in a completely cell-free in vitro assay as 
performed here. Thus such assays appear to be of limited value 
even for a preliminary selection of target sites. However, 
predictions by the MFold program suggest a correlation of 
certain features of the predicted secondary structures of target 
sequences, helical stability in particular, with ribozyme 
efficacy. The generality of these findings will, however, need 
to be investigated in an alternate test system. If these correlations 
should be confirmed, this would represent a significant improve- 
ment in the preliminary selection of candidate ribozymes. 
Ultimately, howev^, an empirical cell-based assay will still need 
to be p^ormed to select the best of these candidates. 
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ABSTRACT 

Higli-throughput genomic technologies are revolu- 
tionizing modern biology. In particular, DNA micro- 
arrays have become one of the most powerful tools 
for profiling global mRNA expression in different 
tissues and environmental conditions, and for 
detecting single nucleotide polymorphisms. The 
broad applicability of gene expression profiling to 
the biological and medical realms has generated 
expanding demand for mass production of micro- 
arrays, which in turn has created considerable 
Interest In improving the cost effectiveness of 
microarray fabrication techniques. We have devel- 
oped the computational framework for an optimal 
synthesis strategy for oligonucleotide microarrays. 
The problem was introduced by Hubbell et al Here, 
we formalize the problem, obtain precise bounds on 
Its complexity and devise several computational 
solutions. 



INTRODUCTION 

Oligonucleotide and cDNA microarrays can monitor mRNA 
expression levels for tens of thousands of genes simul- 
taneously (1). While both types of arrays are applied to the 
elucidation of normal and pathological cellular mechanisms, 
the longer probes on cDNA arrays make them more suscep- 
tible to cross-hybridization, and oligo arrays are designed to 
reduce cross-hybridization and improve sensitivity (2). In 
addition, oligo microarrays can be used to detect polymorph- 
isms (3) and, therefore, can greatly facilitate the research and 
diagnosis of genetic predisposition to diseases. Several large 
companies, such as Affymetrix, Coming, Motorola and 
Samsung, have established or are establishing the capability 
of manufacturing microarrays in large quantities. Thus, 
reductions in cost or time to manufacture can have a 
significant impact on biotechnology and medicine. 

cDNA microarrays are produced by spotting pre-made 
cDNA solutions onto a glass or nylon surface via physical 



contact or ink-jet deposition. While oligonucleotides can 
also be synthesized and then spotted, oligo microarrays are 
usually manufactured by in situ synthesis, primarily via 
photolithography (4), and more recently by ink-jet deposition 
(5). In situ synthesis involves the consecutive addition of A, C, 
G and T nucleotides to the appropriate spots on the 
microarray. An important advantage of photolithographic 
synthesis over ink-jet deposition is that in a single cycle of 
synthesis, a nucleotide can be added to all desired spots on 
the array. This is achieved via photodeprotection of the 
target spots on the array surface with UV light prior to the 
addition of the nucleotide. Meanwhile, non-target spots must 
be protected from the UV light using physical or virtual 
masks. The fabrication of physical masks is a laborious and 
costly process and one mask is needed for each cycle of 
synthesis for each variety of arrays. Singh-Gasson et al (6) 
used a digital micro-mirror device to reflect light selectively 
onto the desired spots of an array, the 'virtual masking' 
strategy. Nonetheless, the deprotection step for each cycle 
lasts ~5 min and photolabile nucleosides are expensive. 
Therefore, decreasing the number of cycles required to 
synthesize a given set of sequences can reduce time and 
cost. Here, we address synthesis optimization, i.e., optimizing 
the order of nucleotide addition. 

The simplest strategy for synthesizing a given set of 
sequences is to add A bases wherever appropriate as the first 
base, then C, G and T bases, repeating this process for the 
second base, and so on. Ghee et al (3) noted that if K is the 
length of the longest oligonucleotide to be synthesized, 
maximally AK cycles are required. Hubbell et al (7) observed 
that it would be possible to skip a synthesis cycle if a base is 
not needed by any oligonucleotides, or if the oligonucleotides 
that require the base can still be synthesized when that base is 
presented again later. In a parallel publication, Tolonen et al 
(8) observe that synthesis could be accelerated, even for a 
large set of oligonucleoddes, if the order of base addition is 
tailored to the oligonucleotide sequences. Consequently, 
oligonucleotides can vary in length by more than one base at 
the end of every synthesis cycle. This observation has 
motivated the development of the optimal base addition 
strategy described in this paper. 



*To whom correspondence should be addressed at 44 Cummington Street, Boston University, Boston, MA 02215, USA. Tel: +1 617 358 1845; 
Fax: +1 617 353 6766; Email: kasif@bu.edu 

The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors 



el06 Nucleic Acids Research, 2002, Vol. 30 No. 20 



Page 2 OF 6 



A C 

c 

OTTO 
C O 6 C 

A c c A chip surface 




c c 

B A C C A C 

c 

c c 

A c c A Chip surface 



I 

6 T T G 
C O O C 
C C A C 

Figure 1. Formulation of synthesis strategy. (A) Four 3mer oligonucleotides. (B) Four partially constructed oligos, defining frontier F = [3,1,1,2]. 
(C) The synthesis strategy [A,C,G,T] can synthesize the array in four cycles, instead of six cycles required by the traditional approach. 
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A COMPUTATIONAL FRAMEWORK FOR OPTIMAL 
SYNTHESIS STRATEGY 

We formulate the question of devising an optimal synthesis 
strategy in oligo microarrays as a combinatorial state space 
search problem. This computational formulation provides 
insight into the complexity of the problem and enables a range 
of discrete optimization and heuristic search solutions. 

In this paper, we assume that the input to the optimization 
software is a collection of N oligo sequences of arbitrary 
lengdi, which have been pre-selected in a probe selection 
process. For simplicity, we discuss the case of uniform length 
ATmers, but our framework is readily applicable to the more 
general case. An optimal synthesis strategy involves L cycles 
of synthesis where, in each cycle, a single and identical 
nucleotide is added to ail unmasked oligos. The exact spatial 
location of each oligo on the array is not important, as long as 
it can be retrieved during actual synthesis. Therefore, we 
assume that the input to the optimization code is a list of oligos 
arranged in one dimension, such as shown in Figure lA. We 
define a strategy for constructing ^mers in L cycles as an L 
long vector 5, consisting of elements A, C, G and T. Slf] is the 
nucleotide added in cycle j. For instance, the vector 
[A,C,A,T,G] corresponds to using A, C, A, T and G in cycles 
1-5, respectively. We define the height of each partially 
constructed oligo as the number of nucleotides that have been 
added thus far by the synthesis strategy. We define a frontier F 
of a partially synthesized array to be an integer vector of size N 
where F[i] is the height of the ith oligo thus far. For example, 
the frontier of the four oligos in Figure IB is F = [3,1,1,2). 



The 'traditional' way to create a chip of N oligos, each of 
height K, is to perform 4K cycles of synthesis. After the 
addition of A, C, G and T to the appropriate spots of the airay, 
all oligos will be one base long. We then proceed to synthesize 
layer two in four cycles and all oligos will be two bases long. 
We continue until tfie entire chip is synthesized in 4/^ cycles. It 
is easy to observe that by a slight modification of the order of 
base addition (8), we can expedite the above process. As a 
simplified example (Fig. IC), we can synthesize an array of K 
= 3 in four cycles using a modified synthesis strategy, 
compared to six cycles with the 'traditional' approach. 

In order to introduce the optimization framework for 
masking, we need to measure the *work* that has been 
accomplished after several cycles of synthesis. We therefore 
give two definitions of 'frontier height': (i) the min height of a 
frontier constructed after L cycles is the length of the shortest 
oligo [in Fig. IB, min height (L) = 1]; (ii) the sum height of a 
frontier is the sum of the lengths of all oligos constructed so far 
[in Fig. IB, sum height (L) = 3 + 1 + 1 + 2 = 7]. 

The objective optimization criterion we desire to minimize 
is the number of cycles required to create a frontier of min 
height = Ky i.e., we seek the shortest length strategy vector that 
is sufficient to synthesize all oligos on the chip. It is obvious 
that the best possible strategy for a Kmcr oligo chip is of 
length between K and 4K. It is easy to construct an example 
where the shortest strategy is of length 4K (Fig. 2A), although 
genomic sequences typically do not exhibit such extremely 
low complexity. In general, as the number of oligos on a chip 
grows, the length of the optimal strategy vector is expected to 
grow as well. 
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Figure 2. Example arrays that challenge synthesis strategies. (A) A worst-case scenario requiring 4K stages for Kmcr synthesis. (B) The optimal solution for 
this chip requires 9 cycles whereas the greedy solution produces a 12-cycle strategy. However, the sum height-based greedy solution produces the optimal 
synthesis strategy. (C) For this example, sum height heuristics create an 8-cycle strategy: AATAATAA. The exhaustive search produces a 7-cycle strategy: 
ATAATAA. 



Local Search Algorithm: 
Input: A Strategy of Length M 

Until No Mo/B Improvement is Obsen/ed Iterate the Following Steps: 

1, For each position in I the current strategy 0< l<M 

a. Try every possible NT in position I 

b. Compute the length of the resulting strategy 

2. Accept the strategy if the strategy Is best over 4M perturbations (strategies) tried. 
Figure 3. A local search algorithm. 



HEURISTIC SEARCH SOLUTIONS 

An obvious heuristic solution to devising the optimal synthesis 
strategy is a greedy search. In each synthesis cycle, we 
consider the four different options to extend the current layout 
and compute the height of the resulting frontier for each 
option. We choose the nucleotide that maximizes the height of 
the frontier, which could be either min height or sum height. 
The min height heuristic extends the shortest oligo, while the 
sum height heuristic chooses the nucleotide that will add the 
largest number of nucleotides to the chip. 

The simple examples in Figure 2B and C show that a 
greedy search is not guaranteed to produce an optimal 
solution using either the min height or the sum height 
heuristic. In the remainder of this section, we consider ways 
to improve the greedy search. In the next section, we show 
that an efficient polynomial time solution is unlikely to exist 
for this optimization problem. We then report simulation 
results that describe die effectiveness of the sum height 
heuristics. 

Look-ahead solution 

A natural way to extend the greedy algorithm is to consider a 
look-ahead strategy, (i) Generate all possible frontiers that can 
be generated in L cycles. The number of strategies is 4^. The 
number of frontiers might be smaller since different strategies 
may generate the same frontier, (ii) For each frontier, compute 
the height, (iii) Choose (for the first cycle) the strategy that 
maximizes the height after L cycles, (iv) Repeat until all ohgos 
have been synthesized. 

When L = 4Ky this algorithm performs an exhaustive search. 
A rough upper bound on the running time of this algorithm is 
O (4^A0, which makes it prohibitive for large values of L An 
alternative approach would be to use a variant of best-first 
search such as A*, a popular algorithm in the artificial 



intelligence community. A more space-efficient alternative 
would be to use a branch-and-bound formulation, a standard 
approach in discrete optimization. 

Local search solution 

A local search attempts to improve a given solution by a series 
of local perturbations until a minimum is achieved. One 
obvious local search approach for our problem would be to 
repeatedly change a selected nucleotide in the strategy vector 
and accept the new strategy if it results in an improvement 
over the previous one. Here, we implement a variant (Fig. 3) 
based on steepest descent. 

The steepest descent algorithm considers every possible 
local perturbation of a single nucleotide in all positions of the 
strategy vector and chooses the move that results in the 
greatest improvement. This algorithm terminates relatively 
quickly since there are at most M possible improvements to be 
made. Each iteration (steps 1-2) requires time 0(A/), so the 
total running time is 0(A/^). 

A simple classic variant of this algorithm is to accept a new 
strategy with some probability even if no improvement is 
observed. A Gibbs sampler is a special case of this solution 
when the probability of acceptance is a function of the degree 
of improvement, and positions for possible perturbations are 
selected at random rather than sequentially as described above. 
We describe simulation results using local search below. 



COMPUTATIONAL ANALYSIS OF OPTIMAL 
SYNTHESIS STRATEGY 

In this section, we provide a set of computational reductions 
that allow us to obtain a precise characterization of the 
complexity of the optimal synthesis strategy. We see this part 
as the main contribution of this paper. 
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Multiple sequence alignment formulation 

We first observe that the optimal masking problem can be 
reduced to a special case of multiple sequence alignment. A 
precise description of multiple sequence alignment can be 
found in Gusfield (9) and Waterman (10). In particular, the 
best L cycle synthesis strategy directly corresponds to the 
optimal multiple alignment of the oligos, where the costs of 
the alignment are defined as follows: (i) replacement cost = 
+00; (ii) deletion cost = +oo; (iii) insertion cost = +1 . That is, the 
only allowed 'editing* operation is the insertion of a gap. We 
first demonstrate this principle with an example. For the 
oligo design problem in Figure 2B, we first align the two 
oligos CCCAAA and AAACCC. An optimal alignment is 
given by 

CCCAAA 
AAACCC 

Walking across the alignment from left to right creates the 
following synthesis strategy: [CCCAAACCC]. Another 
optimal strategy is [AAACCCAAAl. 

The formal proof of this equivalence is not difficult. Each 
strategy corresponds to a multiple alignment obtained by 
aligning each sequence against the strategy sequence. 
Therefore, the shortest strategy corresponds to the shortest 
global alignment. 

This observation enables the application of computational 
solutions developed in multiple sequence analysis such as 
dynamic programming, Gibbs sampling and iterative refine- 
ment (9,10). A common greedy solution is based on aligning 
each pair, then producing a strategy based on the best aligned 
pair and subsequently continuing to add oligos to the 
alignment in the best-first manner. 

Shortest super-sequence formulation 

The above observation is useful for obtaining insight into the 
problem. By reducing our problem to a special case of 
multiple sequence alignment, we show that multiple align- 
ment is 'harder*, which does not preclude the possibility of an 
efficient solution to our specific problem. Here we show that 
the optimal synthesis strategy problem is exactly equivalent to 
the problem of computing the shortest super-sequence of a 
collection of strings. This two-way reduction establishes our 
problem to be as hard as the shortest super-sequence problem, 
which is known to be NP hard. 

We informally define sequence X to be a super-sequence of 
sequence Y if every character of Y occurs in X in the same 
order as they occur in Y, Similarly, we define a super-sequence 
of a collection of sequences where the above condition has 
to hold for each of the sequences. For instance, 
AAAACCCTTTT is a super-sequence of ACT, AAACCCT 
and AAAATTT. Sequence X is the shortest super-sequence of 
a collection of sequences if and only if its length is the shortest 
among all super-sequences of the collection. 

Since the synthesis strategy must be a super-sequence of 
each of the oligos, the optimal synthesis strategy vector is 
equivalent to the shortest super-sequence of all oligos. The 
shortest super-sequence problem is known to be NP hard (11) 
and therefore the reduction above formally establishes the 
optunal synthesis strategy problem to be NP hard. More 



explicitly, the problem of finding a masking strategy of length 
L (L < 4/0, given a collection of N oligos of length K, is NP 
complete. 

This is an important observation since it implies that the 
optimal synthesis strategy is unlikely to have efficient (sub- 
exponential) optimal solutions for a large number of oligos. It 
is easy, of course, to devise relatively efficient dynamic 
programming solutions when the number of oligos is constant 
(e.g. less than 10). 



SIMULATIONS WITH RANDOM OLIGOS 

We have conducted a large number of simulations to estimate 
the performance of several heuristic approaches to devising an 
optimal synthesis strategy. Here we report our results with 
three heuristic approaches, (i) Oblivious strategy: we simply 
repeat synthesizing ACGTACGT. . . independent of the input 
sequences, (ii) Max sum height heuristics: we choose the 
nucleotide that maximizes the sum height in the next cycle (as 
outlined above), (iii) Randomized local search improvement: 
once a solution is obtained by the above two methods, we 
attempt to improve the solution using local search. 

Our results comparing the oblivious and max sum height 
heuristics are summarized in Table 1. It is clear that for 
random oligos there is no significant difference in perfor- 
mance between max height and oblivious heuristics. It is not 
particularly surprising for random oligos since, roughly 
speaking, every layer in the chip contains an approximately 
equal number of nucleotides of each type (A, C, G and T). 
Moreover, our results for 'real* oligos appear to be consistent 
with this performance (data not shown). Note that one of the 
criteria for selecting oligos aims to prevent cross-hybridiza- 
tion between mRNA and multiple oligos. This puts 'selective 
pressure* on the design to ensure that oligos are as different as 
possible. As the number of oligos on the chips grows, they 
behave more and more like random oligos. 

We have interpolated the expected length of a strategy for 
10 000 oligos and it appears to fit the following function well: 

f{K) = 2.5K = 4My/K 



where K is the height of the oligos. 

Figure 4 shows an essentially linear fit of the data. The 
graph was produced by fitting the functiony(X) = 2.5K-¥ Cy/K, 
where C is the single adjustable parameter. As a result, the fit 
is linear. The formal derivation that proves this expectation for 
max sum height is implied by the analysis in Jiang and Li (11). 

Now we provide a brief motivation for the above inter- 
polating function. When we find the shortest alignment of a 
single random oligo XiX2.^,XK with ACGTACGT..,, Xi 
aligns with the first, second, third or fourth base, X2 aligns with 
the first, second, third or fourth base after Xj, X3 aligns with 
the first, second, third or fourth base after X2, etc. Thus the 
expected distance between Xi and Xi + i is (1 + 2 + 3 + 4)/4 = 
2.5. For example, if Xi = C, then the distance between Xi and 
Xi ^. , is 1 if + 1 = T, 2 if Xi + , = G, 3 if Xj + 1 = A or 4 if 
X| + 1 = C. The variance of the above possible distances is 1,25. 
Therefore, we expect X1X2...XK to require an alignment of 
length 2.5A' and, by the law of large numbers, a random oligo 
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Table 1. Comparison of the oblivious strategy with the max sum height heuristic 


K 


N 


Max sum 
height 


Standard 
deviation 


ACGT 


Standard 
deviation 


ADV 


10 


10 000 


37.4 


0.70 


37.7 


0.95 


-3 




20 000 


37.9 


0.74 


38.2 


1.14 


-3 




30 000 


37.7 


0.67 


37.9 


0.32 


-2 




40 000 


38.0 


0.67 


38.1 


0.57 


-1 


20 


10 000 


68.1 


0.57 


67.7 


1.25 


4 




20 000 


69,0 


0.67 


68.7 


0.67 


3 




30 000 


69.7 


0.67 


70.7 


0.82 


-10 




40 000 


69.7 


1.06 


70.1 


1.20 


-4 


40 


10 000 


125.6 


1.26 


127.1 


2.47 


-15 




20 000 


126.9 


1.20 


127.8 


1.32 


-9 




30 000 


127^ 


1.08 


129.5 


2.92 


-20 




40 000 


128.2 


1.14 


129.5 


2.51 


-13 


60 


10 000 


181,3 


1.49 


183.2 


2.94 


-19 




20 000 


183.3 


1.42 


184.0 


1 35 


— / 




30 000 


183.8 


U69 


184.3 


1.77 


-5 




40 000 


184.8 


1.81 


185.4 


2.72 


-6 


80 


10 000 


235.1 


1.66 


238.7 


2.50 


-36 




20 000 


237.8 


1.87 


240.9 


3.63 


-31 




30 000 


238.2 


1.55 


240.1 


3.21 


-19 




40 000 


240.3 


1.89 


242.7 


3.16 


-24 


100 


10 000 


290.6 


2.01 


291.8 


2.10 


-12 




20 000 


292.6 


2.17 


294 J 


2.33 


-15 




30 000 


293.3 


1.49 


295.8 


2.57 


-25 




40 000 


294.6 


1.17 


296.6 


2.27 


-20 



For each approach, we list the length of a strategy averaged over 10 experiments and the standard deviation. 
We also give the cumulative savings in nucleotides over 10 experiments (the ADV column). Note that the 
standard deviation is higher for the oblivious strategy (ACGT). K and N are the length and number of oligos 
on the microarray, respectively. 



300 



5^150 




Figure 4. A linear fit of the function /(A) = 2.5K + 4.04\/i^ to the simulation data produced by the oblivious strategy for 1000 oligos. The scaling constant 
(4.04) depends on the total number of oligos. 
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Table 2. Results of local search improvement (LS) for both max sum 
height (MSH) and oblivious solutions (ACGT) 



K 


N 


MSH 


MSH-LS 


ACGT 


ACGT-LS 


10 


100 


33.1 


32.6 


34.6 


32.5 




1000 


35.9 


35.7 


36.1 


35.7 


20 


100 


62.3 


61.1 


63.3 


60.6 




1000 


65.4 


65.3 


65.9 


64.7 


40 


100 


115.4 


114.6 


117.3 


114.2 



K and N are the length and number of oligos on the microarray, 
respectively. 



requires an alignment of length at most 2,5K + 
OiVK) with high probability. In order to align N oligos, 
length 2.5A:+ 0(logN\/^) suffices with high probability. For a 
fixed number of oligos, the logarithm term is constant. 
Therefore, we can use the formula 25K + 0{y/K), 

We also used a simple local search to improve the solutions 
produced by the max sum height and oblivious strategies. The 
results are given in Table 2. We found that when the number of 
oligos is small (e.g. 100), the improvement was better 
(-3-5%). However, as the number of oligos grows (e.g. 
1000), the percentage of improvement was reduced to 
-1-1.5%. This is interesting, since Gibbs sampling is a close 
relative of local search and is a popular algorithm for multiple 
sequence alignment. However, in typical multiple alignments 
of proteins, we often align tens to hundreds of sequences. In 
this paper, we need to 'align' thousands to hundreds of 
thousands of oligos and it appears to have an impact on the 
degree of improvement obtained with this approach. 

CONCLUSIONS 

In this paper, we have presented a computational formaliza- 
tion of the optimal synthesis strategy for oligonucleotide 
microarrays. We have shown that the problem is computa- 
tionally intractable (NP complete). We have provided several 
simulation results that shed light on its practical complexity. 
As the number of applications of oligo microarrays increases 
and their use in diagnostic medical applications becomes a 
common practice, we expect the design of DNA chips to 
become more sophisticated and efficient. Our main conclusion 
from both the theoretical and simulation analyses provided in 
this paper is that the problem of optimal masking appears to be 
computationally difficult. Moreover, the simplest possible 



solution appears to work almost as well as more sophisticated 
approaches that include heuristic greedy approaches and local 
search. It would be interesting to see if more exhaustive 
approaches based on best-first search, branch-and-bound or 
Gibbs sampling methods will generate a more dramatic 
improvement in performance. Naturally, these results must be 
confirmed in the context of practically used DNA chips. 
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15 Background of the Invention 

Antisense and double-stranded RNA oligonucleotides are promising therapeutic 
agents and useful research tools for elucidating gene function. However, it is often difficult 
to achieve efficient inhibition of protein synthesis using such compositions. 

In order to maximize their therapeutic activity, it would be of great benefit to improve 
20 upon the prior art antisense and double-stranded RNA oligonucleotides by enhancing the 
efficiency with which they inhibit protein synthesis. 



Snmmary of the Invention 

The instant invention is based, at least in part, on the discovery of antisense and 
25 double-stranded oligonucleotide compositions that provide improved inhibition of gene 

expression. In particular, the oligonucleotide compositions of the present invention make use 
of combinations of antisense or double-stranded oligonucleotides. 

In one aspect, the invention pertains to an oligonucleotide composition comprising at 
least 3 different oligonucleotides targeted to at least three different nucleotide sequences 
30 within a target gene, wherein (i) the oligonucleotides bind to their target nucleotide sequence 
with high affinity and (ii) the oligonucleotides are GC enriched. 

In one embodiment, the oligonucleotides are antisense oligonucleotides. 
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In another embodiment, the oligonucleotides are double-stranded RNA 
oligonucleotides. 

In one embodiment, the oligonucleotide compositions bind to their target nucleotide 
sequence with a Tm of at least about 60°C. 
5 In one embodiment, the oligonucleotides have a GC content of at least about 20%, 

In one embodiment, the composition comprises at least about 4 antisense 
oligonucleotides targeting at least four different nucleic acid sequences. In another 
embodiment, tlie composition comprises at least about 5 oligonucleotides targeting at least 
five different nucleic acid sequences. In still another embodiment, the composition comprises 
10 at least about 6 oligonucleotides targeting at least six different nucleic acid sequences. 

In one embodiment, the oligonucleotides are at least about 25 nucleomonomers in 
length. In another embodiment, the oligonucleotides are greater than about 25 
nucleomonomers in length. 

In one embodiment, at least one of the antisense oligonucleotides is complementary in 
1 5 sequence to its target nucleotide sequence. In another embodiment, the antisense 
oligonucleotides activate RNase H. 

In one embodiment, at least one of the oligonucleotides comprise at least one 
modified intemucleoside linkage. 

In another embodiment, at least one of the oUgonucleotides comprise at least one 
20 modified sugar moiety. 

In one embodiment, the composition further comprises a pharmaceutically acceptable 

carrier. 

In one embodiment, the oligonucleotide composition achieves a level of inhibition of 
protein synthesis the same as or higher than the level of inhibition achieved by the most 
25 effective individual oligonucleotide of the composition. 

In one embodiment, the mdividual oligonucleotides are not separately tested for their 
ability to inhibit protein synthesis prior to their incorporation into the composition. In this 
respect, the present invention represents a substantial and unrecognized improvement over the 
state of the art. 

30 In one embodiment, the oligonucleotide composition results in greater than about 80% 

inhibition of protein synthesis. 
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In another aspect, the invention pertains to a method of inhibiting protein synthesis in 
a cell comprising contacting the cell with at least 3 different oligonucleotides targeted to at 
least three different nucleotide sequences within a target gene, wherein (i) the 
oligonucleotides bmd to their target nucleotide sequence with high affinity and (ii) the 
5 oligonucleotides are GC enriched, to thereby inhibit protein synthesis. 

In one embodiment, the oligonucleotides are antisense oligonucleotides. In another 
embodhnent, the oligonucleotides are double-stranded RNA oligonucleotides. 

In one embodiment, the method is performed in a high-throughput format. 

In still another aspect, the invention pertains to a method of identifying function of a 
10 gene encoding a protein comprising: contacting the cell with at least 3 different 

oligonucleotides targeted to at least three different nucleotide sequences within a target gene, 
wherein (i) the oligonucleotides bind to their target nucleotide sequence with high affinity 
and (ii) the oligonucleotides are GC enriched, and assaying for a change in a detectable 
phenotype in the cell resulting from the inhibition of protein expression, to thereby determine 
1 5 the function of a gene. 

The relative amounts of these different oligonucleotides may optionally be different. 
That is, the three or more different oligonucleotides may be present in equimolar 
concentrations, or non-equimolar concentrations. 

In one embodiment, the oligonucleotides are antisense oligonucleotides. In another 
20 embodiment, the oligonucleotides are double-stranded RNA oligonucleotides. 

In one embodiment, the method is performed in a high-throughput format. 

In another aspect, the invention pertains to a method of making the oligonucleotide 
composition, comprising: combining at least 3 different oligonucleotides targeted to at least 
three different nucleotide sequences within a target gene, wherein (i) the oligonucleotides 
25 bind to their target nucleotide sequence with high affinity and (ii) the oligonucleotides are GC 
enriched, and wherein the individual oligonucleotides are not separately tested for their 
ability to inhibit protein synthesis prior to their incorporation into the composition. 

In one embodiment, the oligonucleotides are antisense oligonucleotides. In another 
embodiment, the oligonucleotides are double-stranded RNA oligonucleotides. 

30 In another aspect, the mvention pertains to an oligonucleotide composition comprising 

at least 3 different double-stranded RNA oligonucleotides targeted to at least three different 
nucleotide sequences within a target gene. 
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In Still another aspect, the invention pertains to a method of inhibiting protein 
synthesis in a cell comprising contacting the cell (or cell lysate) with at least 3 different 
double-stranded RNA oligonucleotides targeted to at least three different nucleotide 
sequences within a target gene. 

5 In yet another aspect, the invention pertains to a method of identifying function of a 

gene encoding a protein comprising: contacting the cell with at least 3 different double- 
stranded RNA oligonucleotides targeted to at least three different nucleotide sequences within 
a target gene and assaying for a change in a detectable phenotype in the cell resulting from 
the inhibition of protein expression, to thereby determine the function of a gene. 

10 In another aspect, the invention pertains to a method of making an oligonucleotide 

composition comprising combining at least 3 different double-stranded RNA oligonucleotides 
targeted to at least three different nucleotide sequences within a target gene wherein, the 
individual oligonucleotides are not separately tested for their ability to inhibit protein 
synthesis prior to their incorporation into the composition. 

15 

Drawings 

Figure 1 shows a summary of the results of about 30 antisense inhibition experiments 
against about thirty different genes in cell culture. Oligonucleotide compositions comprising 
mixtures of oligonucleotides (with the worst 10% of target genes removed) are compared 
20 with the best individual oligonucleotides and data for all individual oligonucleotides in the 
percent inhibition observed. 

Figure 2 shows ultramer data for a mixture of siRNA complexes targeting p53. 

Figure 3 shows ultramer data for a mixture of siRNA complexes targeting GTP20. 

Figure 4 shows ultramer data for a mixture of siRNA complexes targeting Cbfa-1 . 

25 Figure 5 shows ultramer data for a mixture of siRNA complexes targeting FTP mu. 

Figure 6 shows ultramer data for a mixture of siRNA complexes targeting FTP-PEST. 

Figure 7 shows ultramer data for a mixture of siRNA complexes targeting FTP eta. 
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Detailed Description of the Ipventioii 

Although inhibition of protein synthesis could be achieved with certain antisense and 
double-stranded RNA oligonucleotides of the prior art, multiple transfections were requned 
to identify effective oligonucleotides. The instant invention advances the prior art, inter alia, 
5 by providing oligonucleotide compositions that enhance the efficiency with which protein 
synthesis is inhibited and methods of making and using these improved oligonucleotide 
compositions. 

Methods of stabilizing oligonucleotides, particularly antisense oligonucleotides, by 
formation of a duplex with a complementary oligonucleotide, are disclosed in co-pending 

10 application no. U.S. , filed on the same day as the present application, bearing 

attorney docket number "SRI-020 " and entitled "Double-Stranded Oligonucleotides." This 
application and all of its teachings is hereby expressly incorporated herein by reference in its 
entirety. 

15 Antisense and Double-stranded RNA Oligonucleotide Compositions 

Antisense or double-stranded RNA oligonucleotides for incorporation into 
compositions of the invention inhibit the synthesis of a target protein, which is encoded by a 
target gene. The target gene can be endogenous or exogenous {e.g., introduced into a cell by 
a virus or using recombinant DNA technology) to a cell. As used herein, the tenn "target 

20 gene" includes polynucleotides comprising a region that encodes a polypeptide or 

polynucleotide region that regulates replication, transcription, translation, or other process 
important in expression of the target protein or a polynucleotide comprising a region that 
encodes the target polypeptide and a region that regulates expression of the target 
polypeptide. Accordingly, the term "target gene" as used herein may refer to, for example, an 

25 mRNA molecule produced by transcription a gene of interest. Furthermore, the term 

"correspond," as in "an oligomer corresponds to a target gene sequence," means that the two 
sequences are complementary or homologous or bear such other biologically rational 
relationship to each other (e.g., based on the sequence of nucleomonomers and their base- 
pairing properties). 

30 The ^target gene" to which an RNA molecule of the invention is directed may be 

associated with a pathological condition. For example, the gene may be a pathogen- 
associated gene, e.g., a viral gene, a tumor-associated gene, or an autoimmxme disease- 
associated gene. The target gene may also be a heterologous gene expressed in a recombinant 
cell or a genetically altered organism. By determining or modulating {e.g., inhibiting) the 

35 function of such a gene, valuable information and therapeutic benefits in medicine, veterinary 
medicine, and biology may be obtained. 
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The term "antisense" refers to a nucleotide sequence that is inverted relative to its 
normal orientation for transcription and so expresses an RNA transcript that is 
complementary to a target gene mRNA molecule expressed within the host cell (e.g., it can 
hybridize to the target gene mRNA molecule through Watson-Crick base pairing). An 

5 antisense strand may be constructed in a number of different ways, provided that it is capable 
of interfering with the expression of a target gene. For example, the antisense strand can be 
constructed by inverting the coding region (or a portion thereof) of the target gene relative to 
its normal orientation for transcription to allow the transcription of its complement, (e.g., 
RNAs encoded by the antisense and sense gene may be complementary). Furthermore, the 

10 antisense oligonucleotide strand need not have the same intron or exon pattern as the target 
gene, and noncoding segments of the target gene may be equally effective in achieving 
antisense suppression of target gene expression as coding segments. 

The term "oligonucleotide" includes two or more nucleomonomers covalently 
coupled to each other by linkages or substitute Imkages. An oligonucleotide may comprise, 

15 for example, between a few (e.g.,7, 10, 12, 15) or a few hundred ( e.g, 100, 200, 300, or 400) 
nucleomonomers. For example, an oligonucleotide of the invention preferably comprises 
between about 10 and about 50 nucleomonomers , between about 15 and about 40, or 
between about 20 and about 30 nucleomonomers. In one embodhnent, an oligonucleotide 
comprises about 25 nucleomonomers. In another embodiment, an oligonucleotide comprises 

20 greater than about 25 nucleomonomers. 

Oligonucleotides may comprise, for example, oligonucleotides, oligonucleosides, 
polydeoxyribonucleotides (containing 2'-deoxy-D-ribose) or modified forms thereof, e.g., 
DNA, polyribonucleotides (containing D-ribose or modified forms or analogs thereof), RNA, 
or any other type of polynucleotide which is an N-glycoside or C-glycoside of a purine or 

25 pyrimidine base, or modified purine or pyrimidine base. The term oligonucleotide includes 
compositions in which adjacent nucleomonomers are linked via phosphorothioate, amide or 
other linkages {e.g., Neilsen, P.E., et al 1991. Science. 254:1497). Generally, the term 
"linkage" refers to any physical connection, preferably covalent coupling, between two or 
more nucleic acid components, e.g., catalyzed by an enzyme such as a ligase. 

30 The term "oligonucleotide" includes any structure that serves as a scaffold or support 

for the bases of the oligonucleotide, where the scaffold permits binding to the target nucleic 
acid molecule in a sequence-dependent maimer. 

An "overhang" is a relatively short single-stranded nucleotide sequence on the 5'- or 
3' -hydroxy 1 end of a double-stranded oligonucleotide molecule (also referred to as an 
35 "extension," "protruding end," or "sticky end"). 
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Oligonucleotides of the invention are isolated. The term "isolated" includes nucleic 
acid molecules which are synthesized {e.g., chemically, enzymatically, or recombinantly) or 
are naturally occurring but separated from other nucleic acid molecules which are present in a 
natural source of the nucleic acid. Preferably, a naturally occurring "isolated" nucleic acid 
5 molecule is free of sequences which naturally flank the nucleic acid molecule (ie. , sequences 
located at the 5' and 3' ends of the nucleic acid molecule) in a nucleic acid molecule in an 
organism from which the nucleic acid molecule is derived. 

The term "nucleomonomer" includes bases covalently linked to a second moiety. 
Nucleomonomers include, for example, nucleosides and nucleotides. Nucleomonomers can 
10 be linked to form oligonucleotides that bind to target nucleic acid sequences in a sequence 
specific manner. The term "second moiety" as used herein includes substituted and 
unsubstituted cycloalkyl moieties, e.g., cyclohexyl or cyclopentyl moieties, and substituted 
and unsubstituted heterocyclic moieties, e.g., 6-member morpholino moieties or, preferably, 
sugar moieties. 

1 5 Sugar moieties include natural, unmodified sugars, e.g. , monosaccharides (such as 

pentoses, e.g., ribose), modified sugars and sugar analogs. Possible modifications of 
nucleomonomers include, for example, replacement of one or more of the hydroxyl groups 
with a halogen, a heteroatom, an aliphatic group, or the fimctionalization of the group as an 
ether, an amine, a thiol, or the like. For example, modified sugars include D-ribose, 2'-0- 

20 alkyl (including 2'-0-methyl and 2'-0-ethyl), /.e.,2'-alkoxy, 2'-amino, 2'-S-alkyl, 2'-halo 
(including 2'-fluoro), 2'-methoxyethoxy, 2'-allyloxy (-OCH2CH=CH2), 2'-propargyl, 2'- 
propyl, ethynyl, ethenyl, propenyl, and cyano and the like. In one embodiment, the sugar 
moiety can be a hexose and incorporated into an oligonucleotide as described (Augustyns, K., 
etal,Nticl Acids. Res. 1992. 18:4711). Exemplary nucleomonomers can be found, e.g., in 

25 U.S. Patent 5,849,902. 

As used herem, the term "nucleotide" includes any monomeric unit of DNA or RNA 
containing a sugar moiety (pentose), a phosphate, and a nitrogenous heterocyclic base. The 
base is usually linked to the sugar moiety via the glycosidic carbon (at the 1 ' carbon of 
pentose) and that combination of base and sugar is called a "nucleoside." The base 

30 characterizes the nucleotide with the four customary bases of DNA being adenine (A), 

guanine (G), cytosine (C) and thymine (T). Inosine (I) is an example of a synthetic base that 
can be used to substitute for any of the four, naturally-occurring bases (A, C, G or T). The 
four RNA bases are A, G, C, and uracil (U). Accordingly, an oligonucleotide may be a 
nucleotide sequence comprising a linear array of nucleotides connected by phosphodiester 

35 bonds between the 3' and 5' carbons of adjacent pentoses. Other modified 

nucleosides/nucleotides are described herein and may also be used in the oligonucleotides of 
the invention. 
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One particularly usefiil group of modified nucleomonomers are 2'-0-methyl 
nucleotides, especially when the 2'-0-methyl nucleotides are used as nucleomonomers in the 
ends of the oligomers. Such 2*0-methyl nucleotides may be referred to as "methylated " and 
the corresponding nucleotides may be made from unmethylated nucleotides followed by 
5 alkylation or directly from me%lated nucleotide reagents. Modified nucleomonomers may 
be used in combination with unmodified nucleomonomers. For example, an oligonucleotide 
of the invention may contain both methylated and unmethylated nucleomonomers. 

Some exemplary modified nucleomonomers include sugar-or backbone-modified 
ribonucleotides. Modified ribonucleotides may contain a noimaturally occurring base 

10 (instead of a naturally occurring base) such as uridines or cytidines modified at the 5-position, 
e.g., 5-(2-aniino)propyl uridine and 5-bromo uridine; adenosines and guanosines modified at 
tlie 8-position, e.g., 8-bromo guanosine; deaza nucleotides, e.g., 7-deaza-adenosine; andN- 
alkylated nucleotides, e.g., N6-methyl adenosine. Also, sugar-modified ribonucleotides may 
have the 2'-0H group replaced by a H, alxoxy (or OR), R or alkyl, halogen, SH, SR, amino 

1 5 (such as NH2, NHR, NR2), or CN group, wherem R is lower alkyl, alkenyl, or alkynyl. 

Modified ribonucleotides may also have the phosphoester group connecting to 
adjacent ribonucleotides replaced by a modified group, e.g., of phosphothioate group. More 
generally, the various nucleotide modifications may be combined. 

The term "alkyl" includes saturated aliphatic groups, mcluding straight-chain alkyl 

20 groups (e.g. , methyl, ethyl, propyl, butyl, pentyl, hexyl, heptyl, octyl, nonyl, decyl, etc), 
branched-chain alkyl groups (isopropyl, tert-butyl, isobutyl, etc.), cycloalkyl (alicyclic) 
groups (cyclopropyl, cyclopentyl, cyclohexyl, cycloheptyl, cyclooctyl), alkyl substituted 
cycloalkyl groups, and cycloalkyl substituted alkyl groups. In certain embodiments, a 
straight chain or branched chain alkyl has 6 or fewer carbon atoms in its backbone (e.g., C,- 

25 Cg for straight chain, C3-C6 for branched chain), and more preferably 4 or fewer. Likewise, 
preferred cycloalkyls have from 3-8 carbon atoms in their ring structure, and more preferably 
have 5 or 6 carbons in the ring structure. The term Ci-C^ includes alkyl groups containing 1 
to 6 carbon atoms. 
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Moreover, unless otherwise specilBed, the term alkyl includes both "unsubstituted 
alkyls" and "substituted alkyls " the latter of which refers to alkyl moieties having 
substituents replacing a hydrogen on one or more carbons of the hydrocarbon backbone. 
Such substituents can include, for example, alkenyl, alkynyl, halogen, hydroxy!, 

5 alkylcarbonyloxy, arylcarbonyloxy, alkoxycarbonyloxy, aryloxycarbonyloxy, carboxylate, 
alkylcarbonyl, arylcarbonyl, alkoxycarbonyl, aminocarbonyl, alkylaminocarbonyl, 
dialkylaminocarbonyl, alkylthiocarbonyl, alkoxyl, phosphate, phosphonato, phosphinato, 
cyano, amino (including alkyl amino, dialkylamino, arylamino, diarylamino, and 
alkylarylamino), acylamino (including alkylcarbonylamino, arylcarbonylamino, carbamoyl 

10 and ureido), amidino, imino, sulfliydryl, alkylthio, arylthio, thiocarboxylate, sulfates, 
alkylsulfmyl, sulfonato, sulfamoyl, sulfonamido, nitro, trifluoromethyl, cyano, azido, 
heterocyclyl, alkylaryl, or an aromatic or heteroaromatic moiety. Cycloalkyls can be fiirther 
substituted, e.g., with the substituents described above. An "alkylaryl" or an "arylalkyl" 
moiety is an alkyl substituted with an aryl (e.g., phenylmethyl (benzyl)). The term "alkyl" 

15 also includes the side chains of natural and unnatural amino acids. The term "n-alkyl" means 
a straight chain (i.e., imbranched) unsubstituted alkyl group. 

The term "alkenyl" includes unsaturated aliphatic groups analogous in length and 
possible substitution to the alkyls described above, but that contain at least one double bond. 
For example, the term "alkenyl" includes straight-chain alkenyl groups (e.g., ethylenyl, 

20 propenyl, butenyl, pentenyl, hexenyl, heptenyl, octenyl, nonenyl, decenyl, etc.), branched- 
chain alkenyl groups, cycloalkenyl (alicyclic) groups (cyclopropenyl, cyclopentenyl, 
cyclohexenyl, cycloheptenyl, cyclooctenyl), alkyl or alkenyl substituted cycloalkenyl groups, 
and cycloalkyl or cycloalkenyl substituted alkenyl groups. In certain embodiments, a straight 
chain or branched chain alkenyl group has 6 or fewer carbon atoms in its backbone (e.g., Cj- 

25 Q for straight chain, C3-Q for branched chain). Likewise, cycloalkenyl groups may have 
from 3-8 carbon atoms in their ring structure, and more preferably have 5 or 6 carbons in the 
ring stmcture. The term Cj-C^ includes alkenyl groups containing 2 to 6 carbon atoms. 
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Moreover, unless otherwise specified, the tenn alkenyl includes botli "unsubstituted 
alkenyls" and "substituted alkenyls," the latter of which refers to alkenyl moieties having 
substituents replacing a hydrogen on one or more carbons of the hydrocarbon backbone. 
Such substituents can include, for example, alkyl groups, alkynyl groups, halogens, hydroxyl, 

5 alkylcarbonyloxy, arylcarbonyloxy, alkoxycarbonyloxy, aryloxycarbonyloxy, carboxylate, 
alkylcarbonyl, aryicarbonyl, alkoxycarbonyl, aminocarbonyl, alkylaminocarbonyl, 
dialkylaminocarbonyl, alkylthiocarbonyl, alkoxyl, phosphate, phosphonato, phosphinato, 
cyano, amino (including alkyl amino, dialkylamino, arylamino, diarylamino, and 
alkylarylamino), acylamino (including alkylcarbonylamino, arylcarbonylamino, carbamoyl 

1 0 and ureido), amidino, imino, sulfhydryl, alkylthio, arylthio, thiocarboxylate, sulfates, 
alkylsulfmyl, sulfonato, sulfamoyl, sulfonamido, nitro, trifluoromethyl, cyano, azido, 
heterocyclyl, alkylaryl, or an aromatic or heteroaromatic moiety. 

The term "alkynyl" includes unsaturated aliphatic groups analogous in length and 
possible substitution to the alkyls described above, but which contain at least one triple bond. 

1 5 For example, the term "alkynyl" includes straight-chain alkynyl groups (e.g. , ethynyl, 

propynyl, butynyl, pentynyl, hexynyl, heptynyl, octynyl, nonynyl, decynyl, etc.), branched- 
chain alkynyl groups, and cycloalkyl or cycloalkenyl substituted alkynyl groups. In certain 
embodiments, a straight chain or branched chain alkynyl group has 6 or fewer carbon atoms 
in its backbone (e.g., C2-C5 for straight chain, C^-C^ for branched chain). The term Cj-Cg 

20 includes alkynyl groups containing 2 to 6 carbon atoms. 

Moreover, unless otherwise specijHed, the term alkynyl includes both "unsubstituted 
alkynyls" and "substituted alkynyls," the latter of which refers to alkynyl moieties having 
substituents replacing a hydrogen on one or more carbons of the hydrocarbon backbone. 
Such substituents can include, for example, alkyl groups, alkynyl groups, halogens, hydroxyl, 

25 alkylcarbonyloxy, arylcarbonyloxy, alkoxycarbonyloxy, aryloxycarbonyloxy, carboxylate, 
alkylcarbonyl, aryicarbonyl, alkoxycarbonyl, aminocarbonyl, alkylammocarbonyl, 
dialkylaminocarbonyl, alkylthiocarbonyl, alkoxyl, phosphate, phosphonato, phosphinato, 
cyano, amino (including alkyl amino, dialkylamino, arylamino, diarylamino, and 
alkylarylamino), acylamino (including alkylcarbonylamino, arylcarbonylamino, carbamoyl 

30 and ureido), amidino, imino, sxilfhydryl, alkylthio, arylthio, thiocarboxylate, sulfates, 
alkylsulfinyl, sulfonato, sulfamoyl, sulfonamido, nitro, trifluoromethyl, cyano, azido, 
heterocyclyl, alkylaryl, or an aromatic or heteroaromatic moiety. 

Unless the number of carbons is otherwise specified, "lower alkyl" as used herein 
means an alkyl group, as defined above, but having firom one to five carbon atoms in its 
35 backbone structure. "Lower alkenyl" and "lower alkynyl" have chain lengths of, for example, 
2-5 carbon atoms. 
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The term "alkoxy" includes substituted and unsubstituted alkyl, alkenyl, and alkynyl 
groups covalently linked to an oxygen atom. Examples of alkoxy groups include methoxy, 
ethoxy, isopropyloxy, propoxy, butoxy, and pentoxy groups. Examples of substituted alkoxy 
groups include halogenated alkoxy groups. The alkoxy groups can be substituted with groups 

5 such as alkenyl, alkynyl, halogen, hydroxyl, allcylcarbonyloxy, arylcarbonyloxy, 
alkoxycarbonyloxy, aryloxycarbonyloxy, carboxylate, alkylcarbonyl, arylcarbonyl, 
alkoxycarbonyl, aminocarbonyl, alkylaminocarbonyl, dialkylaminocarbonyl, 
alkylthiocarbonyl, alkoxyl, phosphate, phosphonato, phosphinato, cyano, amino (including 
alkyl amino, dialkylamino, arylamino, diarylamino, and alkylarylamino), acylamino 

10 (including alkylcarbonylamino, arylcarbonylamino, carbamoyl and ureido), amidino, imino, 
sulfliydryl, alkylthio, arylthio, thiocarboxylate, sulfates, alkylsulfmyl, sulfonato, sulfamoyl, 
sulfonamido, nitro, trifluoromethyl, cyano, azido, heterocyclyl, alkylaryl, or an aromatic or 
heteroaromatic moieties. Examples of halogen substituted alkoxy groups include, but are not 
limited to, fluoromethoxy, difluoromethoxy, trifluoromethoxy, chloromethoxy, 

1 5 dichloromethoxy , trichloromethoxy, etc. 

The term "heteroatom" includes atoms of any element other than carbon or hydrogen. 
Preferred heteroatoms are nitrogen, oxygen, sulfur and phosphorus. 

The term "hydroxy" or "hydroxyl" mcludes groups with an -OH or -O" (with an 
appropriate counterion). 

20 The term "halogen" includes fluorine, bromine, chlorine, iodine, etc. The term 

"perhalogenated" generally refers to a moiety wherein all hydrogens are replaced by halogen 
atoms. 

The term "substituted" includes substituents which can be placed on the moiety and 
which allow the molecule to perform its intended function. Examples of substituents include 

25 alkyl, alkenyl, alkynyl, aryl, (CR'R'VsNR'R', (CR'ROo-sCN, NO^, halogen, 
(CR'R')o.3C(halogen)3, (CR'R')o.3CH(halogen)2, (CR'R' VaCHsOialogen), 
(CR'R'VaCONR^R', (CR'ROo.3S(0),.2NR'R% (CR^R'VsCHO, (CR'R')o.30(CR'R')o-3H, 
(CR'ROo.3S(0)o.2R', (CR'R')o.30(CR'ROa-3H, (CR'R')o-3COR\ (CR'ROo^CO^R', or 
(CR'R')o-30R' groups; wherein each R' and R' are each independently hydrogen, a 

30 C,-C5 alkyl, C2-C5 alkenyl, C2-C5 alkynyl, or aryl group, or R' and R' taken together are a 
benzylidene group or a -(CH2)20(CH2)2- group. 

The term "amine" or "amino" includes compounds or moieties in which a nitrogen 
atom is covalently bonded to at least one carbon or heteroatom. The term "alkyl amino" 
includes groups and compounds wherein the nitrogen is bound to at least one additional alkyl 

35 group. The term "dialkyl amino" includes groups wherein the nitrogen atom is bound to at 
least two additional alkyl groups. 
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The term "ether" includes compounds or moieties which contain an oxygen bonded to 
two different carbon atoms or heteroatoms. For example, the term includes "alkoxyalkyl" 
which refers to an alkyl, alkenyl, or alkynyl group covalently bonded to an oxygen atom 
which is covalently bonded to another alkyl group. 
5 The term "ester" includes compounds and moieties which contain a carbon or a 

heteroatom bound to an oxygen atom which is bonded to the carbon of a carbonyl group. The 
term "ester" includes alkoxycarboxy groups such as methoxycarbonyl, ethoxycarbonyl, 
propoxycarbonyl, butoxycarbonyl, pentoxycarbonyl, etc. 

The term "base" includes the known purine and pyrimidine heterocyclic bases, 
1 0 deazapurines, and analogs (including heterocycl substituted analogs, e.g. , aminoethyoxy 
phenoxazine), derivatives (e.g., 1-alkenyl-, 1 -alkynyl-, heteroaromatic-, and 1 -alkynyl 
derivatives) and tautomers thereof Examples of purines include adenine, guanine, inosine, 
diaminopurme, and xanthine and analogs {e.g, 8-oxo-N^'methyladenine or 7-diazaxanthine) 
and derivatives thereof. Pyrimidines include, for example, thymine, uracil, and cytosine, and 
1 5 their analogs {e.g , 5-methylcytosine, 5-methyluracil, 5-(l-propynyl)uracil, 5-(l - 

propynyl)cytosine and 4,4-ethanocytosine). Other examples of suitable bases include non- 
purinyl and non-pyrimidinyl bases such as 2-aminopyridine and triazines. 

The term "nucleoside" includes bases which are covalently attached to a sugar moiety, 
preferably ribose or deoxyribose. Examples of preferred nucleosides include ribonucleosides 
20 and deoxyribonucleosides. Nucleosides also include bases linked to amino acids or amino 
acid analogs which may comprise free carboxyl groups, free amino groups, or protecting 
groups. Suitable protecting groups aie well known in the art (see P.G.M. Wuts and T.W. 
Greene, "Protective Groups in Organic Synthesis", 2"^ Ed., Wiley-Interscience, New York, 
1999). 

25 The term "nucleotide" includes nucleosides which further comprise a phosphate group 

or a phosphate analog. 

In a preferred embodiment, the nucleomonomers of an oligonucleotide of the 
invention are RNA nucleotides. In another preferred embodunent, the nucleomonomers of an 
oligonucleotide of the invention are modified RNA nucleotides. 
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As used herein, the term "linkage" includes a naturally occurring, uiunodified 
phosphodiester moiety (-O-CPOjO-O-) that covalently couples adjacent nucleomonomers. As 
used herein, the term "substitute linkage" includes any analog or derivative of the native 
phosphodiester group that covalently couples adjacent nucleomonomers. Substitute linkages 
5 include phosphodiester analogs, e.g., such as phosphorothioate, phosphorodithioate, and P- 
ethyoxyphosphodiester, P-ethoxyphosphodiester, P-alkyloxyphosphotriester, 
methylphosphonate, and nonphosphorus containing linkages, e.g., such as acetals and amides. 
Such substitute linkages are known in the art (e.g, Bjergarde et al 1 991 . Nucleic Acids Res. 
19:5843; Caruthers et al 1991. Nucleosides Nucleotides. 10:47). 

10 Oligonucleotides of the invention comprise 3' and 5' termini (except for circular 

oligonucleotides). The 3' and 5' termini of an oligonucleotide can be substantially protected 
from nucleases e.g., by modifying the 3' or 5' linkages {e,g, U.S. patent 5,849,902 and WO 
98/13526). For example, oligonucleotides can be made resistant by the inclusion of a 
"blockmg group." The term "blocking group" as used herem refers to substituents {e.g , 

1 5 other than OH groups) that can be attached to oligonucleotides or nucleomonomers, either as 
protecting groups or coupling groups for synthesis (e.g., hydrogen phosphonate, 
phosphoramidite, or POj^")- "Blocking groups" also include "end blocking groups" or 
"exonuclease blocking groups" which protect the 5' and 3' termini of the oligonucleotide, 
including modified nucleotides and non-nucleotide exonuclease resistant structures. 

20 Exemplary end-blocking groups include cap structures {e.g. , a 7-methylguanosine 

cap), mverted nucleomonomers, e.g, with 3'-3' or 5'-5' end inversions (see e.g, Ortiagao et 
al 1992. Antisense Res. Dev. 2:129), methylphosphonate, phosphoramidite, non-nucleotide 
groups (e.g. , non-nucleotide linkers, amino linkers, conjugates) and the like. The 3 ' terminal 
nucleomonomer can comprise a modified sugar moiety. The 3' terminal nucleomonomer can 

25 comprise a 3'-0 that can optionally be substituted by a blocking group that prevents 3'- 
exonuclease degradation of the oligonucleotide. For example, the 3'-hydroxyl can be 
esterified to a nucleotide through a 3'->3' intemucleotide linkage. For example, the alkyloxy 
radical can be methoxy, ethoxy, or isopropoxy, and preferably, ethoxy. Optionally, the 3'->3' 
linked nucleotide at the 3' terminus can be linked by a substitute linkage. To reduce nuclease 

30 degradation, the 5' most 3'-)-5' linkage can be a modified linkage, e.g. , a phosphorothioate or 
a P-alkyloxyphosphotriester linkage. Preferably, the two 5' most 3'->5' linkages are 
modified linkages. Optionally, the 5' terminal hydroxy moiety can be esterified with a 
phosphorus containing moiety, e.g., phosphate, phosphorothioate, or P-ethoxyphosphate. 

In one embodiment, an oligonucleotide may comprise a 5' phosphate group or a group 
3 5 larger than a phosphate group. 
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In one embodiment, the oligonucleotides included in the composition are high affinity 
oligonucleotides. The term "high affinity" as used herein includes oligonucleotides that have 
a Tm (melting temperature) of or greater than about 60^C, greater than about 65°C, greater 
than about 70^C, greater than about 75°C, greater than about 80 °C or greater than about 85 

5 °C. The Tm is the midpoint of the temperature range over which the oligonucleotide 

separates fi-om the target nucleotide sequence. At this temperature, 50% helical (hybridized) 
versus coil (unhybridized) forms are present. Tm is measuied by using the UV spectrum to 
deteraiine the formation and breakdown (melting) of hybridization. Base stacking occurs 
during hybridization, which leads to a reduction in UV absorption. Tm depends both on GC 

1 0 content of the two nucleic acid molecules and on the degree of sequence complementarity. 
Tm can be determined usinfe techniques that are known in the art (see for example, Monia et 
al 1993. 1 Biol Chem. 268:145; Chiang e/^/. 1991. 1 Biol Chem. 266:18162; Gagnor er 
al Nucleic Acids Res. 15:10419; Monia a/. 1996. Proc, Natl Acad. Sci. 93:15481; 
Publisis and Tinoco. 1989. Methods in Enzymology 180:304; Tliuonge/^i/. 1987. Proc. 

15 Natl Acad Set USA 84:5129). 

One skilled in the art will recognize that the length of an RNAi oligonucleotide 
corresponds to a region of complementarity to the target in the antisense stranded, and the 
RNAi may be longer, if, for example the RNAi is of a hairpin design. 

In one embodiment, an oligonucleotide can include an agent which increases the 
20 affinity of the oligonucleotide for its target sequence. The term "affinity enhancing agent" 
includes agents that increase the afifmity of an oligonucleotide for its target. Such agents 
include, e.g., intercalating agents and high affinity nucleomonomers. Intercalating agents 
interact strongly and nonspecifically with nucleic acids. Intercalating agents serve to stabilize 
RNA-DNA duplexes and thus increase the affmity of the oligonucleotides for their targets. 
25 Intercalating agents are most commonly linked to the 3' or 5' end of oligonucleotides. 

Examples of intercalatmg agents include: acridine, chlorambucil, benzopyridoquinoxaline, 
benzopyridoindole, benzophenanthridine, and phenazmium. The agents may also impart 
other characteristics to the oligonucleotide, for example, increasing resistance to 
endonucleases and exonucleases. 
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In one embodiment, a high affinity nucleomonomer is incorporated into an 
oligonucleotide. The language "high affinity nucleomonomer" as used herein includes 
modified bases or base analogs that bind to a complementary base in a target nucleic acid 
molecule with higher affinity than an unmodified base, for example, by having more 

5 energetically favorable interactions with the complementary base, e.g, , by forming more , 
hydrogen bonds with the complementary base. For example, high affinity nucleomonomer 
analogs such as aminoethyoxy phenoxazme (also referred to as a G clamp), which forms four 
hydrogen bonds with guanine are included in the term "high affinity nucleomonomer." A 
high affmity nucleomonomer is illustrated below {see, e.g., Flanagan, et al, 1999. Proc, Natl 

10 Acad. Sci. 96:3513). 




Antisense ON 



(Le., guanine and aminoethyoxy phenoxazine) 

15 Other exemplary high affmity nucleomonomers are known in the art and include 7- 

alkenyl, 7-alkynyl, 7-heteroaromatic-, or 7-alkynyl-heteroaromatic-substituted bases or the 
like which can be substituted for adenosine or guanosine in oligonucleotides (see e.g., U.S. 
patent 5,594,121). Also, 7~substituted deazapurines have been found to impart enhanced 
binding properties to oligonucleotides, i.e,, by allowing them to bind with higher affmity to 

20 complementary target nucleic acid molecules as compared to unmodified oligonucleotides. 
High affinity nucleomonomers can be incorporated into the oligonucleotides of the instant 
invention using standard techniques. 
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In another embodiment, an agent that increases the affinity of an oligonucleotide for 
its target comprises an intercalating agent As used herein the language "intercalating agent" 
includes agents which can bind to a DNA double helix. When covalently attached to an 
oHgonucleotide of the invention, an intercalating agent enhances the binding of the 

5 oligonucleotide to its complementary genomic DNA target sequence. The intercalating agent 
may also increase resistance to endonucleases and exonucleases. Exemplary intercalating 
agents are taught by Helene and Thuong (1989. Genome 3 1 :413), and include e.g., acridine 
derivatives (Lacoste et al 1997. Nucleic Acids Research. 25:1991; Kukreti et al 1997. 
Nucleic Acids Research, 25:4264); quinoline derivatives (Wilson et al 1993. Biochemistry 

10 32:10614); benzo[qquino[3,4-b]quioxaline derivatives (Marchandef a/. 1996. Biochemistry. 
35:5022; Escude et al 1998. Proc, Natl Acad. ScL 95:3591). Intercalating agents can be 
incorporated into an oligonucleotide using any convenient linkage. For example, acridine or 
psoralen can be linked to the oligonucleotide tihrough any available -OH or -SH group, e.g., 
at the terminal 5' position of the oligonucleotide, the 2' positions of sugar moieties, or an OH, 

1 5 NH2, COOH or SH incorporated into the 5-position of pyrimidines using standard methods. 

In one embodiment, when included in an RNase H activating antisense 
oligonucleotide, an agent that increases the affmity of an oligonucleotide for its target is not 
positioned adjacent to an RNase activating region of the oligonucleotide, e.g., is positioned 
adjacent to a non-RNase activating region. Preferably, the agent that increases the affinity of 

20 an oligonucleotide for its target is placed at a distance as far as possible from the RNase 
activating domain of the chimeric antisense oligonucleotide such that the specificity of the 
chimeric antisense oligonucleotide is not altered when compared with the specificity of a 
chimeric antisense oligonucleotide which lacks the intercalating compound. In one 
embodiment, this can be accomplished by positioning the agent adjacent to a non-RNase 

25 activating region. The specificity of the oligonucleotide can be tested by demonstrating that 
transcription of a non-target sequence. Preferably a non-target sequence which is structurally 
similar to the target {e.g., has some sequence homology or identity with the target sequence 
but which is not identical in sequence to the target) is not inhibited to a greater degree by an 
oligonucleotide comprising an affmity enhancing agent directed against the target than by an 

30 oligonucleotide that does not comprise an affinity enhancing agent that is directed against the 
target. 

In one embodhnent, the oligonucleotides of the invention are GC enriched. As used 
herein the term "GC enriched" includes oligonucleotides that have a relatively high percent 
GC content. For example, in one embodiment an oligonucleotide of the invention has at least 
35 about 20%, at least about 30%, at least about 40% GC content. In another embodiment, an 
oligonucleotide of the invention has at least about 50%, at least about 60%, or at least about 
70% GC content. 
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In one embodiment, the oligonucleotides of the invention are at least about 25 
nucleomonomers in length. In one embodiment, the antisense oligonucleotides of the 
invention are greater than about 25 nucleomonomers in length. In one embodiment, an 
antisense oligonucleotide of the invention is at least about 30, at least about 40, at least about 
5 50, or at least about 60, at least about 70, at least about 80, or at least about 90 
nucleomonomers in length. 

Double'Stranded RNA Oligonucleotides 

Double-stranded RNA (double-stranded RNA or RNAi (double-stranded RNA 
1 0 interference)) is a double-stranded RNA oligonucleotide that can be used to inhibit protein 
synthesis in a cell (see, e.g., WO 01/36646A1; Elbashir et al 2001. Genes & Deveolpment 
1 5: 1 88; Elbashir et al 2001 . Nature 41 1 :494; Elbashir et al 2001 EMBO. 20:6877). Double- 
stranded RNA may be formed by a single, self-complementary strand or two separate 
complementary strands. Duplex formation can occur either inside or outside the cell 
1 5 containing the target gene. 

As used herein, the term "double-stranded" includes one or more nucleic acid 
molecules comprising a region of the molecule in which at least a portion of the 
nucleomonomers are complementary and hydrogen bond to form a duplex. 

As used herem, the term "duplex" includes the region of the double-stranded nucleic 
20 acid molecule(s) that is (are) hydrogen bonded to a complementary sequence. 

Accordingly, one aspect of the invention is a method of inhibiting the activity of a 
target gene by introducing an RNAi agent into a cell, such that the dsRNA component of the 
RNAi agent is targeted to the gene. In one embodiment, an RNA oligonucleotide molecule 
may contain at least one nucleomonomer that is a modified nucleotide analogue. The 
25 nucleotide analogues may be located at positions where the target-specific activity, e.g., the 
RNAi mediating activity is not substantially effected, e.g., in a region at the 5 '-end or the 3'- 
end of the double-stranded molecule, where the overhangs may be stabilized by incorporating 
modified nucleotide analogues. 
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In another aspect, double-stranded RNA molecules known in the art can be used in the 
methods of the present invention. Double-stranded RNA molecules known in the art may 
also be modified according to the teachings herein in conjunction with such methods, e.g., by 
using modified nucleomonomers. For example, see U.S. 6,506,559; U.S. 2002/0,173,478 Al; 
5 U.S. 2002/0,086,356 Al ; Shuey, et al , "RNAi: gene-silencing in therapeutic intervention." 
Drug Discov. Today 2002 Oct 15;7(20): 1040-6; Aoki, et al, "Clin. Exp. Pharmacol. Physiol. 
2003 Jan;30(l-2):96-102; Cioca, et al, "RNA interference is a functional pathway with 
therapeutic potential in human myeloid leukemia cell lines. Cancer Gene Ther. 2003 
Feb;10(2):125-33. 

1 0 Further examples of double-stranded RNA molecules include those disclosed in the 

following references: Kawasaki, et al, "Short hairpin type of dsRNAs that are controlled by 
tRNA(Val) promoter significantly induce RNAi-mediated gene silencing in the cytoplasm of 
human cells." Nucleic Acids Res. 2003 Jan 15;31(2):700-7; Cottrell, et al, "Silence of the 
strands: RNA interference in eukaryotic pathogens " Trends Microbiol. 2003 Jan;ll(l):37- 

15 43; Links, "MammaHan RNAi for the masses." Trends Genet 2003 Jan;19(l):9-12; Hamada, 
et al, "Effects on RNA interference in gene expression (RNAi) in cultured mammalian cells 
of mismatches and the introduction of chemical modifications at the 3'-ends of siRNAs." 
Antisense Nucleic Acid Drug Dev. 2002 Oct;12(5):301-9; Links, "RNAi and related 
mechanisms and their potential use for therapy." Curr. Opin. Chem. Biol. 2002 Dec;6(6):829- 

20 34; Kawasaki, et al , "Short hairpin type of dsRN As that are controlled by tRN A(Val) 
promoter significantly induce RNAi-mediated gene silencing in the cytoplasm of human 
cells." Nucleic Acids Res. 2003 Jan 15;31(2):700-7.) 

Double-stranded RNA molecule comprises a nucleotide sequence which is 
substantially identical to at least part of the target gene. In one embodiment, a double- 

25 stranded RNA molecule comprises a nucleotide sequence which is at least about 100 % 
identical to a portion of the target gene. In another embodiment, a double-stranded RNA 
molecule comprises a nucleotide sequence which is at least about 95 % identical to a portion 
of the target gene. In another embodiment, a double-stranded RNA molecule comprises a 
nucleotide sequence which is at least about 90 % identical to a portion of the target gene. In 

30 another embodiment, a double-stranded RNA molecule comprises a nucleotide sequence 
which is at least about 80 % identical to a portion of the target gene. In another embodunent, 
a double-stranded RNA molecule comprises a nucleotide sequence which is at least about 60 
% identical to a portion of the target gene. In another embodiment, a double-stranded RNA 
molecule comprises a nucleotide sequence which is at least about 100 % identical to a portion 

35 of the target gene. 
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To determine the percent identity of two nucleic acid sequences, the sequences are 
aligned for optimal comparison purposes (e.g. , gaps can be introduced in one or both of a first 
and a second amino acid or nucleic acid sequence for optimal alignment and non-identical 
sequences can be disregarded for comparison purposes). In a preferred embodiment, the 

5 length of the target gene sequence aligned for comparison purposes is at least about 25 

nucleotide residues, at least about 50, at least about 100, at least about 150, at least about 200, 
or at least about 300 or more nucleotide residues are aligned. The nucleotides at 
corresponding nucleotide positions are then compared. When a position in the first sequence 
is occupied by the same nucleotide as the corresponding position in the second sequence, then 

10 the molecules are identical at that position. The percent identity between the two sequences 
is a function of the number of identical positions shared by the sequences, taking into account 
the number of gaps, and the length of each gap, which need to be introduced for optimal 
alignment of the two sequences. 

The comparison of sequences and determination of percent identity between two 
1 5 sequences can be accomplished using a mathematical algorithm. In a preferred embodiment, 
the percent identity between two nucleotide sequences is determined using e.g., the GAP 
program in the GCG software package, usmg a NWSgapdna. CMP matrix and a gap weight 
of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In anotiier embodiment, tfie 
percent identity between two nucleotide sequences is determined using the algorithm of E. 
20. Meyers and W. Miller {Comput. Appl Biosci, 4:1 1-17 (1988)) which has been incorporated 
into the ALIGN program (version 2.0), using a PAM120 weight residue table, a gap length 
penalty of 12 and a gap penalty of 4, 

The nucleic acid sequences of the present invention can further be used as a "query 
sequence" to perform alignments against sequences in public databases. Such searches can 

25 be perfoimed using the NBLAST and XBLAST programs (version 2.0) of Altschul et al 
(1990) J. Mol Biol 215:403-10. BLAST nucleotide searches can be performed with the 
NBLAST program, score = 100, wordlength = 12. To obtain gapped alignments for 
comparison purposes. Gapped BLAST can be utilized as described in Altschul et al (1997) 
Nucleic Acids Res. 25(1 7):33 89-3402. When utilizing BLAST and Gapped BLAST 

30 programs, the defauh parameters of the respective programs (e.g. , XBLAST and NBLAST) 
can be used. See, e.g. , the NIH internet website. 

In one embodiment, the oligonucleotides of the mvention are identical to a target 
nucleic acid sequence over at least about 80% of the length of the oligonucleotide. In another 
embodiment, oligonucleotides of the invention are identical to a target nucleic acid sequence 
35 over at least about 90-95 % of the length of tiie oligonucleotide. In anotiier embodiment, 

oligonucleotides of the invention are identical to a target nucleic acid sequence over tiie entire 
length of the oligonucleotide. 
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In yet another embodiment, a sequence of a double-stranded RNA molecule of the 
invention hybridizes to at least a portion of the target gene under stringent hybridization 
conditions. As used herem, the term "hybridizes under stringent conditions" is intended to 
describe conditions for hybridization and washing under which nucleotide sequences at least 

5 60% complementary to each other typically remain hybridized to each other. Preferably, the 
conditions are such that sequences at least about 70%, more preferably at least about 80%, 
even more preferably at least about 85% or 90% complementary to each other typically 
remam hybridized to each other. Such stringent conditions are known to those skilled in the 
art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N. Y. 

10 (1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent hybridization conditions 
are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45*^0, followed by one 
or more washes in 0.2 X SSC, 0.1% SDS at 50*^C, preferably at 55°C, more preferably at 60^ 
C, and even more preferably at eS^'C. Ranges intermediate to the above-recited values, e.g., 
at 60-65*^0 or at 55-60*'C are also mtended to be encompassed by the present invention. 

1 5 Alternatively, formamide can be included in the hybridization solution, using methods and 
conditions also known in the art. 

Antisense Oligonucleotides 

As used herein, the term "antisense oligonucleotide" includes oligonucleotides which 
20 -comprise a nucleotide sequence which is specifically interferes with the synthesis of the 

target polypeptide. In general, antisense oligonucleotides of the invention bind to the "sense" 
strand of the nucleotide sequence of the target gene (e.g., polynucleotides such as DNA, 
mRNA (including pre-mRNA)) molecules. When antisense oligonucleotides of the invention 
bind to nucleic acid molecules, they can bind to any region of the nucleic acid molecule, 
25 including e.g, introns, exons, 5', or 3' untranslated regions. For example, antisense 

oligonucleotides that work as steric blockers preferentially bind within a splice junction, 5' 
untranslated region, or the start region of a nucleic acid target molecule. Antisense 
oligonucleotides that work by activating RNase H preferably bind vsdthin an intron, an exon, 
the 5' imtranslated region, or the 3' untranslated region of a nucleic acid target molecule. 
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Antisense oligonucleotides of the invention may or may not be complementary to 
their target sequence. Without being limited to any particular mechanism of action, an 
antisense oligonucleotide used in an oligonucleotide composition of the invention that can 
specifically hybridize with a nucleotide sequence within the target gene (i,e,, is 

5 complementary to a nucleotide sequence within the target gene) may achieve its affects based 
on, e.g., (1) binding to target mRNA and stericly blocking the ribosome complex from 
translating the mRNA; (2) binding to target mRNA and triggering mRNA cleavage by RNase 
H; (3) binding to double-stranded DNA in the nucleus and forming a triple helix; (4) 
hybridizing to open DNA loops created by RNA polymerase; (5) interfering with mRNA 

10 sphcing; (6) interfering with transport of mRNA from the nucleus to the cytoplasm; or (7) 
interfering with translation through inhibition of the binding of initiation factors or assembly 
of ribosomal subunits (i.e., at the start codon). 

Without being limited to any particular mechanism of action, the antisense 
oligonucleotides used in an oligonucleotide composition of the invention that can not 

1 5 specifically hybridize with a nucleotide sequence within the target gene (are not 

complementary to a nucleotide sequence within the target gene) may achieve their affects 
based on, e.g., (1) the secondary structure of the oligonucleotide; (2) hybridization to a 
different nucleotide sequence; (3) binding to proteins or other molecules that may affect the 
target gene; or (4) modulating oligonucleotide degradation products which themselves can 

20 affect cellular functions. 

In one embodiment, at least two of the antisense oligonucleotides in an 
oligonucleotide composition of the invention inhibit protein synthesis via the same 
mechanism. In another embodiment, at least two of the antisense oligonucleotides in an 
oligonucleotide composition inhibit protein synthesis via a different mechanism. In yet 

25 another embodiment, all of the antisense oligonucleotides present in an oligonucleotide 
composition inhibit protein synthesis via the same mechanism. The oligonucleotide 
compositions of the present invention may comprise antisense oligonucleotides which rely 
simultaneously on several of these modes of action. 

The antisense oligonucleotides used in an oligonucleotide composition of the 

30 invention may be of any type, e.g., including morpholino oligonucleotides, RNase H 
activating oligonucleotides, or ribozymes. 
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In one embodiment, antisense oligonucleotides of the invention are substantially 
complementary to a target nucleic acid sequence. Percent complementarity is determined 
analogously to percent identity. For example, when a position in a test nucleotide sequence is 
occupied by a nucleotide that is complementary to the corresponding position in the reference 

5 sequence, then the molecules are complementary at that position. In one embodiment, an 
antisense RNA molecule comprises a nucleotide sequence which is at least about 100 % 
complementary to a portion of the target gene. In another embodiment, an antisense RNA 
molecule comprises a nucleotide sequence which is at least about 90 % complementary to a 
portion of the target gene. In another embodiment, an antisense RNA molecule comprises a 

1 0 nucleotide sequence which is at least about 80 % complementary to a portion of the target 
gene. In another embodiment, an antisense RNA molecule comprises a nucleotide sequence 
which is at least about 60 % complementary to a portion of the target gene. In another 
embodiment, an antisense RNA molecule comprises a nucleotide sequence which is at least 
about 100 % complementary to a portion of the target gene. Preferably, no loops greater than 

15 about 8 nucleotides are formed by areas of non-complementarity between the oligonucleotide 
and the target. 

In one embodiment, the antisense oligonucleotides of the invention are 
complementary to a target nucleic acid sequence over at least about 80% of the length of the 
oligonucleotide. In another embodiment, antisense oligonucleotides of the invention are 
20 complementary to a target nucleic acid sequence over at least about 90-95 % of the length of 
the oligonucleotide. In another embodiment, antisense oligonucleotides of the invention are 
complementary to a target nucleic acid sequence over the entire length of the oligonucleotide. 

Antisense oligonucleotides of the invention can be "chimeric oligonucleotides" which 
comprise an RNA-like and a DNA-like region. The language "RNase H activating region" 

25 includes a region of an oligonucleotide, e.g. , a chimeric oligonucleotide, that is capable of 
recruiting RNase H to cleave the target RNA strand to which the oligonucleotide binds. 
Typically, the RNase activating region contains a minimal core (of at least about 3-5, 
typically between about 3-12, more typically, between about 5-12, and more preferably 
between about 5-10 contiguous nucleomonomers) of DNA or DNA-like nucleomonomers. 

30 (See e.g. , US patent 5,849,902). More preferably, the RNase H activating region comprises 
about nine contiguous deoxyribose containing nucleomonomers. Preferably, the contiguous 
nucleomonomers are linked by a substitute linkage, e.g., a phosphorothioate linkage. 
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The language "non-activating region" includes a region of an antisense 
oligonucleotide, e.g., a chimeric oligonucleotide, that does not recruit or activate RNase H. 
Preferably, a non-activating region does not comprise phosphorothioate DNA. The 
oligonucleotides of the invention comprise at least one non-activating region. In one 
5 embodiment, the non-activating region can be stabilized against nucleases or can provide 
specificity for the target by being complementary to the target and forming hydrogen bonds 
with the target nucleic acid molecule, which is to be bound by the oligonucleotide. 

Antisense oligonucleotides of the present invention may include "morpholino 
oligonucleotides." Morpholino oligonucleotides are non-ionic and function by an RNase H- 
10 independent mechanism. Each of the 4 genetic bases (Adenine, Cytosine, Guanine, and 

Thymine/Uracil ) of the morpholino oligonucleotides is linked to a 6-membered morpholine 
ring. Morpholino oligonucleotides are made by joining the 4 different subxmit types by non- 
ionic phosphorodiamidate intersubunit linkages. An example of a 2 subunit morphilio 
oligonucleotide is shown below. 

15 




Morpholmo oligonucleotides have many advantages including complete resistance to 
nucleases (Antisense & Nuc. Acid Drug Dev. 1996. 6:267); predictable targetmg 

20 (Biochemica Biophysica Acta. 1999. 1489:141); reliable activity in cells (Antisense & Nuc. 
Acid Drug Dev. 1997. 7:63); excellent sequence specificity (Antisense & Nuc. Acid Drug 
Dev. 1997. 7:151); minimal non-antisense activity (Biochemica Biophysica Acta. 1999. 
1489:141); and simple osmotic or scrape delivery (Antisense & Nuc. Acid Drug Dev. 1997. 
7:291). Morpholino oligonucleotides are also preferred because of their non-toxicity at high 

25 doses. A discussion of the preparation of morpholino oligonucleotides can be foimd ui 
Antisense & Nuc. Acid Drug Dev. 1997. 7:187. 
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A variety of nucleotides of different lengths may be used. In one embodiment, an 
oligonucleotide of the invention is greater than about 25 nucleomonomers in length. In one 
embodiment, an oligonucleotide of the invention is at least about 10, 12, 14, 16, 18, 20, 22, 
24, 26, 27, 28, 29, 30, at least about 40, at least about 50, or at least about 60, at least about 

5 70, at least about 80, or at least about 90 nucleomonomers in length. In another embodiment, 
an oligonucleotide of the invention is less than about 25 nucleomonomers in length, 
particularly about 21 to 23. In yet another embodiment, an oligonucleotide of the invention is 
about 10, 12, 14, 16, 18, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleomonomers in 
length. In another embodiment, an oligonucleotide of the invention is at most about 26, 27, 

10 28, 29, 30, at most about 40, at most about 50, or at most about 60, at most about 70, at most 
about 80, or at most about 90 nucleomonomers in length. 

Preferred nucleomonomers in some aspects are ribonucleotides, mcluding 2'-0- 
methyl ribonucleotides and other 2'-modified RNA molecules. 

Oligomers of the invention may also comprise a DNA gap or a phosphorothioate 
15 DNA gap. 

In some aspects, the present invention relates to compositions and methods 
comprising at least about 4, 5, 6, 7, 8, 9, or 10 antisense oligonucleotides targeting at least 
four, five, six, seven, eight, nine, or ten different nucleic acid sequences. 
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Selection Of Oligonucleotide Sequences 

Once the target protein is selected and the nucleotide sequence which encodes it is 
determined, the sequence of an oligonucleotide for inclusion in the compositions of the 
invention is determined. The sequence of the target gene is analyzed and oligonucleotides are 

5 chosen by a process including both elimination and selection steps. In one embodiment, 
oligonucleotides which have more than 3 of any nucleotide (A, U, C, or G) occurrmg 
consecutively within the oligonucleotide are eliminated. In another embodiment, 
oligonucleotides having dinucleotide repeats (e.g., AUAU, ACAC, AGAG, UCUC, UGUG, 
or CGCG) are elhninated. In another embodiment, oligonucleotides are chosen that target 

10 nucleotide sequences of the target gene that are preferably at least about 25 nucleotides apart. 
In another embodiment, oligonucleotides are chosen that comprise between 4 and 10 
(inclusive) of each base, such that the base composition of the oligonucleotides is similar. In 
another embodiment, the percentage of bases in the oligonucleotide which are G or C is 
greater than 50%. In one embodiment, when oligonucleotides are designed to be 

1 5 complementary to a chosen target sequence, preferably, they are 1 00% complementary to the 
target sequence. In another embodiment, an oligonucleotide preferably has greater than 2 
mismatches to other, non-target genes. This can be tested by one of ordinary skill in the art, 
e.g., using available alignment programs and public databases, e.g., the National Institutes of 
Healtli internet website. 

20 

Oligonucleotide Compositions of the Invention 

This invention relates to oligonucleotide compositions includmg more than one 
individual oligonucleotide molecule. The individual oligonucleotide molecules of the 
composition target at least one target nucleotide sequence of a single target gene. For 

25 example, in one embodiment, at least two of the oligonucleotides present in the composition 
target the same nucleotide sequence in the same target gene e.g., the oligonucleotides 
comprise different chemistries but target (e.g., specifically hybridize to) the same sequence of 
bases in a target nucleic acid molecule. In another embodiment, at least two of the 
oligonucleotides present in the composition target different nucleotide sequences in the same 

30 target gene (e.g. , the oligonucleotide composition comprises one oligonucleotide targeting a 
nucleotide sequence in the promoter of a gene and another oligonucleotide targeting a 
nucleotide sequence in the portion of the coding sequence of the target nucleic acid molecule 
or the oligonucleotide composition comprises at least two different oligonucleotides that 
target two different nucleotide sequences in the coding region of the target nucleic acid 

35 molecule). 
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The number of oligonucleotides used in an oligonucleotide composition of the 
invention can vary from as few as about 2 oligonucleotides to greater than about 20 
oligonucleotides. In one embodiment, at least about 3-4 different oligonucleotides are used in 
the oligonucleotide composition. In another embodiment, at least about 5-6 different 

5 oligonucleotides are used m the oligonucleotide composition. In a further embodiment, at 
least about 7-8 different oUgonucleotides are used in the oligonucleotide composition. In one 
embodiment, greater than about 8 different oligonucleotides are used in an oligonucleotide 
composition of the invention. In a preferred embodiment, the number of different 
oligonucleotides in the oligonucleotide composition is chosen so as to use the minimum 

10 number of different oligonucleotides that effectively inhibit synthesis of the target protein. 

The different oligonucleotides used in an oligonucleotide composition of the 
invention can each be present at the same concentration or can be present in different 
concentrations. For example, more desirable oligonucleotides (e.g., those that are more 
inexpensive or easier to synthesize) may be present at higher concentrations than less 
1 5 desirable oligonucleotides. 

Preferably, the oligonucleotides in a composition are either all double-stranded RNA 
oligonucleotides or all antisense oligonucleotides. 

It will be understood that the individual oligonucleotides of the mvention can be 
synthesized to comprise different chemistries. For example, in one embodiment, a 

20 composition of the invention can comprise at least one oligonucleotide that is optionally GC 
enriched. In another embodiment, a composition of the invention comprises at least one 
oligonucleotide that binds to its target with high affmity. In another exemplary embodiment, 
a composition of the invention comprises at least one that is at least about 25 
nucleomonomers in length. In one embodiment, an oligonucleotide of the invention 

25 comprises an oligonucleotide that is GC enriched and binds to its target with high affmity. 
Thus, as shown by this example, one of skill in the art will recognize that given the teachings 
of the specification, muhiple variations of the individual oligonucleotides present in 
improved oligonucleotide compositions of the invention can be made. 

30 Making Oligonucleotide Compositions 

In one embodiment, an individual oligonucleotide is not individually tested for its 
ability to inhibit protein synthesis prior to its inclusion into a composition of the invention. 
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In another embodiment, an individual oligonucleotide for inclusion in an 
oligonucleotide composition inhibits protein synthesis by about 20% when tested 
individually. In another embodiment, an individual oligonucleotide for inclusion in an 
oligonucleotide composition inhibits gene expression by about 30% when tested individually. 
5 In another embodiment, an individual oligonucleotide for inclusion in an oligonucleotide 
composition inhibits gene expression by about 40% when tested individually. In another 
embodiment, an mdividual oligonucleotide for inclusion in an oligonucleotide composition 
inhibits gene expression by about 50% when tested individually. In another embodiment, an 
individual oligonucleotide for inclusion in an oligonucleotide composition inhibits gene 
10 expression by about 60% when tested individually. Preferably, an individual oligonucleotide 
for inclusion in an oligonucleotide composition inhibits gene expression by less than about 
40% when tested individually. 

In one embodiment, an oligonucleotide composition of the invention inhibits gene 
expression to an extent that is greater than the level of inhibition of gene expression achieved 
15 by any of the individual oligonucleotides of the oligonucleotide composition acting alone. In 
another embodiment, the oligonucleotide composition achieves a level of inhibition of protein 
synthesis the same as or higher than the level of inhibition achieved by the most effective 
individual oligonucleotide of the composition. In one embodiment, an oligonucleotide 
composition of the present invention is at least about 80% effective at inhibiting gene 
20 expression. In another embodiment, an oligonucleotide composition of the present invention 
is at least about 90%-95% effective at inhibiting gene expression. In another embodiment, an 
oligonucleotide composition of the present invention is at least about 99% effective at 
inhibiting gene expression. 

The subject compositions greatly increase the efficiency of the inhibition of protein 
25 synthesis because the ability of an individual oligonucleotide to inhibit protein synthesis does 
not have to be tested prior to its inclusion in an oligonucleotide composition of the invention. 
Accordingly, only one transfection need be done to effectively inhibit protein synthesis. 
Thus, in one embodiment, an oligonucleotide composition of the invention is contacted with 
a cell or population of cells prior to testing the ability of the individual oligonucleotides of the 
30 composition to inhibit target gene expression. In another embodiment, an oligonucleotide 
composition of the invention is contacted with a cell or population of cells subsequent to 
testing tiie abUity of tiie individual oligonucleotides of tiie composition to inhibit target gene 
expression. 
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To achieve inhibition of gene expression, an oligonucleotide composition of the 
invention is contacted with a cell (or cell lysate). In one embodiment, the oligonucleotides of 
an oligonucleotide composition are contacted with a cell simultaneously. In an alternative 
embodiment, the oligonucleotides of an oligonucleotide composition can be brought into 

5 contact with a cell at different times. For example, at least one of the oligonucleotides can be 
contacted with a cell at a different time from the other oligonucleotides. In yet another 
example, each of the oligonucleotides of an oligonucleotide composition is contacted with a 
cell sequentially so that each of the oligonucleotides of an oligonucleotide composition 
comes into contact with the cell at a different time. As such, the compositions of the instant 

1 0 invention can be formulated for separate administration of the oligonucleotides. Preferably, a 
cell is contacted with oligonucleotides of the invention such that the level of inhibition of 
protein synthesis {e.g., as measured either directly (by measuring the decrease in the amount 
of the target protein produced) or, for example, by measuring the disappearance of a 
phenotype associated with the presence of the target protein, by measuring a reduction in the 

15 amount of mRNA produced from the target gene, or by measuring in increase in the level of 
degradation of the mRNA) is greater than that observed when individual nucleotides of the 
invention are tested individually. 

The number of oligonucleotides used to contact a cell can vary from as few as 2 
oligonucleotides to greater than about 20 oligonucleotides. In one embodiment, at least about 
20 2-3 different oligonucleotides are contacted with a cell. In another embodiment, at least about 
4-5 different oligonucleotides are used to contact the cell. In a further embodiment, at least 
about 6-7 different oligonucleotides are contacted with a cell. 

The ability of an oligonucleotide composition of the invention to inhibit protein 
synthesis can be measured using techniques which are known in the art, for example, by 

25 detecting an inhibition in gene transcription or protein synthesis. For example, Nuclease SI 
mapping can be performed. In another example, Northern blot analysis can be used to 
measure the presence of RNA encoding a particular protein. For example, total RNA can be 
prepared over a cesium chloride cushion {see, e.g., Ausebel et al, eds. 1987. Current 
Protocols in Molecular Biology (Greene & Wiley, New York). Northern blots can then be 

30 made using the RNA and probed {see, e.g.. Id) In another example, the level of the specific 
mRNA produced by the target protem can be measured, e.g., using PGR. In yet another 
example. Western blots can be used to measure the amount of target protein present. In still 
another embodiment, a phenotype influenced by the amount of the protein can be detected. 
Techniques for performing Western blots are well known in the art, ^ee, e.g., Chen et al J, 

35 Biol Chem. 271 :28259. 
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In another example, the promoter sequence of a target gene can be linked to a reporter 
gene and reporter gene transcription (e.g., as described in more detail below) can be 
monitored. Alternatively, oligonucleotide compositions that do not target a promoter can be 
identified by fusing a portion of the target nucleic acid molecule with a reporter gene so that 

5 the reporter gene is transcribed. By monitoring a change in the expression of the reporter 
gene in the presence of the oligonucleotide composition, it is possible to determine the 
effectiveness of the oligonucleotide composition in inhibiting the expression of the reporter 
gene. For example, in one embodiment, an effective oligonucleotide composition will reduce 
the expression of the reporter gene. By incrementally adjusting the concentrations and 

1 0 identities of the oligonucleotides in the oUgonucIeotide composition and monitoring the 
resulting change in reporter gene expression, it is possible to optimize the oligonucleotide 
composition. 

A "reporter gene" is a nucleic acid that expresses a detectable gene product, which 
may be RNA or protein. Detection of mKNA expression may be accomplished by Northern 

1 5 blotting and detection of protein may be accomplished by staining with antibodies specific to 
the protein. Preferred reporter genes produce a readily detectable product. A reporter gene 
may be operably Imked with a regulatory DNA sequence such that detection of the reporter 
gene product provides a measure of the transcriptional activity of the regulatory sequence. In 
preferred embodiments, the gene product of the reporter gene is detected by an intrinsic 

20 activity associated with that product. For instance, the reporter gene may encode a gene 
product that, by enzymatic activity, gives rise to a detectable signal based on color, 
fluorescence, or luminescence. Examples of reporter genes include, but are not limited to, 
those coding for chloramphenicol acetyl transferase (CAT), luciferase, p-galactosidase and 
alkaline phosphatase. 

25 One skilled in the art would readily recognize numerous reporter genes suitable for 

use in the present invention. These include, but are not limited to, chloramphenicol 
acetyltransferase (CAT), luciferase, human growth hormone (hGH), and beta-galactosidase. 
Examples of such reporter genes can be found in F. A. Ausubel et aL, Eds., Current Protocols 
in Molecular Biology, John Wiley & Sons, New York, (1989). Any gene that encodes a 

30 detectable product, e.g., any product having detectable enzymatic activity or against which a 
specific antibody can be raised, can be used as a reporter gene in the present methods. 
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One reporter gene system is the firefly luciferase reporter system. (Gould, S. J., and 
Subramani, S. 1988. Anal. Biocheni., 7:404-408 incorporated herein by reference). The 
luciferase assay is fast and sensitive. In this assay, a lysate of the test cell is prepared and 
combined with ATP and the substrate luciferin. The encoded enzyme luciferase catalyzes a 
5 rapid, ATP dependent oxidation of the substrate to generate a light-emitting product. The 
total light output is measured and is proportional to the amount of luciferase present over a 
wide range of enzyme concentrations. 

CAT is another frequently used reporter gene system; a major advantage of this 
system is that it has been an extensively validated and is widely accepted as a measure of 

10 promoter activity. (Gorman C. M., Moffat, L. F., and Howard, B. H. 1982. MoL Cell. BioL, 
2:1044-1051). In this system, test cells are transfected with CAT expression vectors and 
incubated with the candidate substance within 2-3 days of the initial transfection. Thereafter, 
cell extracts are prepared. The extracts are incubated with acetyl CoA and radioactive 
chloramphenicol. Following the incubation, acetylated chloramphenicol is separated from 

1 5 nonacetylated form by thin layer chromatography. In this assay, the degree of acetylation 
reflects the CAT gene activity with the particular promoter. 

Another suitable reporter gene system is based on immunologic detection of hGH. 
This system is also quick and easy to use. (Selden, R., Burke-Howie, K. Rowe, M. E., 
Goodman, H. M., and Moore, D. D. (1986), Mol. Cell, BioL, 6:3173-3179 incorporated 
20 herem by reference). The hGH system is advantageous in that the expressed hGH polypeptide 
is assayed in the media, rather than in a cell extract. Thus, this system does not require the 
destruction of the test cells. It will be appreciated that the principle of this reporter gene 
system is not limited to hGH but rather adapted for use with any polypeptide for which an 
antibody of acceptable specificity is available or can be prepared. 

25 

Uptake Of Oligonucleotides By Cells 

Oligonucleotides and oligonucleotide compositions are contacted with {le., brought 
into contact with, also referred to herein as administered or delivered to) and taken up by one 
or more cells. The term "cells" includes prokaryotic and eukaryotic cells, preferably 
30 vertebrate cells, and, more preferably, mammalian cells. In a preferred embodiment, the 
oligonucleotide compositions of the invention are contacted with human cells. 
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Oligonucleotide compositions of the invention can be contacted with cells in vitt*o or 
in vivo. Oligonucleotides are taken up by cells at a slow rate by endocytosis, but endocytosed 
oligonucleotides are generally sequestered and not available, e.g., for hybridization to a target 
nucleic acid molecule. In one embodiment, cellular uptake can be facilitated by 
5 electroporation or calcium phosphate precipitation. However, these procedures are only 
useful for in viti^o or ex vivo embodiments, are not convenient and, in some cases, are 
associated with cell toxicity. 

In another embodiment, delivery of oligonucleotides into cells can be enhanced by 
suitable art recognized methods including calcium phosphate, DMSO, glycerol or dextran, 

1 0 electroporation, or by transfection, e.g., using cationic, anionic, or neutral lipid compositions 
or liposomes using methods known in the art (see e.g., WO 90/14074; WO 91/16024; WO 
91/17424; U.S.Patent No. 4,897,355; Bergaii et al 1993. Nucleic Acids Research. 21 :3567). 
Enhanced delivery of oligonucleotides can also be mediated by the use of viruses, polyamine 
or poly cation conjugates using compoimds such as polylysine, protamine, or Nl, N12-bis 

15 (ethyl) spennine (see e.g., Bartzatt, R. etal\9%9. BiotechnoL Appl Biochem. 11:133; Wagner 
E. etal 1992. Proc, Natl Acad. Sci. 88:4255) 

Conjugating Agents 

Conjugating agents bind to the oligonucleotide in a covalent manner. In one 
20 embodiment, oligonucleotides can be derivitized or chennically modified by binding to a 
conjugating agent to facilitate cellular uptake. For example, covalent linkage of a cholesterol 
moiety to an oligonucleotide can improve cellular uptake by 5- to 10- fold which in turn 
unproves DNA binding by about 10- fold (Boutorin et al, 1989, FEBS Letters 254:129-132). 
. Conjugation of octyl, dodecyl, and octadecyl residues enhances cellular uptake by 3-, 4-, and 
25 10- fold as compared to unmodified oligonucleotides (Vlassov et al, 1994, Biochimica et 
Biophysica Acta 1 197:95-108). Similarly, derivatization of oligonucleotides with poly-L- 
lysine can aid oligonucleotide uptake by cells (Schell, 1974, Biochem. Biophys. Acta 340:323, 
and Lemaitre et al, 1987, Proc. Natl Acad. Sci. USA 84:648). 
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Certain protein carriers can also facilitate cellular uptake of oligonucleotides, 
including, for example, serum albumin, nuclear proteins possessing signals for transport to 
tlie nucleus, and viral or bacterial proteins capable of cell membrane penetration. Therefore, 
protein carriers are useful when associated with or linked to the oligonucleotides. 

5 Accordingly, the present invention provides for derivatization of oligonucleotides with 
groups capable of facilitating cellular uptake, including hydrocarbons and non-polar groups, 
cholesterol, long chain alcohols (Le., hexanol), poly-L-lysine and proteins, as well as other 
aryl or steroid groups and polycations having analogous beneficial effects, such as phenyl or 
naphthyl groups, quinoline, anthracene or phenanthracene groups, fatty acids, fatty alcohols 

10 and sesquiterpenes, diterpenes and steroids. A major advantage of using conjugating agents 
is to increase the initial membrane interaction that leads to a greater cellular accumulation of 
oligonucleotides. 

Encapsulating Agents 

15 Encapsulating agents entrap oligonucleotides within vesicles. In another embodiment, 

an oligonucleotide may be associated with a carrier or vehicle, e,g., liposomes or micelles, 
although other carriers could be used, as would be appreciated by one skilled in the art. 
Liposomes are vesicles made of a lipid bilayer having a structure similar to biological 
membranes. Such carriers are used to facilitate the cellular uptake or targeting of the 

20 oligonucleotide, or improve the oligonucleotide's pharmacokinetic or toxicologic properties. 

For example, the oligonucleotides of the present invention may also be administered 
encapsulated in liposomes, pharmaceutical compositions wherein the active ingredient is 
contained either dispersed or variously present in corpuscles consisting of aqueous concentric 
layers adherent to lipidic layers. The oligonucleotides, depending upon solubility, may be 

25 jpresent both in the aqueous layer and in the lipidic layer, or in what is generally termed a 
liposomic suspension. The hydrophobic layer, generally but not exclusively, comprises 
phopholipids such as lecithin and sphingomyelin, steroids such as cholesterol, more or less 
ionic surfactants such as diacetylphosphate, stearylamine, or phosphatidic acid, or other 
materials of a hydrophobic nature. The diameters of the liposomes generally range from about 

30 1 5 nm to about 5 microns. 
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The use of liposomes as drug delivery vehicles offers several advantages. Liposomes 
increase intracellular stability, increase uptake efficiency and unprove biological activity. 
Liposomes are hollow spherical vesicles composed of lipids arranged in a similar fashion as 
those lipids which make up the cell membrane. They have an internal aqueous space for 
5 entrapping water soluble compounds and range in size from 0.05 to several microns in 
diameter. Several studies have shown that liposomes can deliver nucleic acids to cells and 
that the nucleic acids remain biologically active. For example, a liposome delivery vehicle 
originally designed as a research tool, such as Lipofectin, can deliver intact nucleic acid 
molecules to cells. 

10 Specific advantages of using liposomes include the foUov^ng: they are non-toxic and 

biodegradable in composition; they display long circulation half-lives; and recognition 
molecules can be readily attached to their surface for targeting to tissues. Finally, cost- 
effective manufacture of liposome-based pharmaceuticals, either in a liquid suspension or 
lyophilized product, has demonstrated the viability of this technology as an acceptable drug 

15 delivery system. 

Complexin^ Agents 

Complexing agents bind to the oligonucleotide by a strong but non-covalent attraction 
{e.g., an electrostatic, van der Waals, pi-stacking interaction, etc.). In one embodiment, 
20 oligonucleotides of the invention can be complexed with a complexing agent to increase 
cellular uptake of oligonucleotides- An example of a complexing agent includes cationic 
lipids. Cationic lipids can be used to deliver oligonucleotides to cells. 

The term "cationic lipid" includes lipids and synthetic lipids having both polar and 
non-polar domains and which are capable of being positively charged at or around 

25 physiological pH and which bind to polyanions, such as nucleic acids, and facilitate the 
delivery of nucleic acids into cells. In general cationic lipids include saturated and 
unsaturated alkyl and alicyclic ethers and esters of amines, amides, or derivatives thereof 
Straight-chain and branched alkyl and alkenyl groups of cationic lipids can contain, e.g., from 
1 to about 25 carbon atoms. Preferred straight chain or branched alkyl or alkene groups have 

30 six or more carbon atoms. Alicyclic groups mclude cholesterol and other steroid groups. 

Cationic lipids can be prepared with a variety of counterions (anions) including, e.g., CI", Bf , 
r, F", acetate, trifluoroacetate, sulfate, nitrite, and nitrate. 
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Examples of cationic lipids include: polyethylenimine, polyamidoamine (PAMAM) 
starburst dendrimers, Lipofectin (a combination of DOTMA and DOPE), Lipofectase, 
Lipofectamine, DOPE, Cytofectin (Gilead Sciences, Foster City, CA), and Eufectins (JBL, 
San Luis Obispo, CA). Cationic liposomes may comprise the following: N-[l-(2,3- 

5 dioleoloxy)-propyl]-N,N,N-trimethylammonium chloride (DOTMA), N-[l-(2,3-dioleoloxy)- 
propyl]-N,N,N-trimethylammonium methylsulfate (DOTAP), 3p-[N-(N',N'- 
dimethylaminoethane)carbamoyl]cholesterol (DC-Chol), 2,3,-dioIeyloxy-N- 
[2(sperminecarboxaniido)ethyl]-N,N-dimethyl-l-propanaminium trifluoroacetate (DOSPA), 
l,2-dimyristyloxypropyl-3-dmiethyl-hydroxyethyl ammonium bromide; and 

1 0 dimethyldioctadecylammonium bromide (DDAB). The cationic lipid N-(l -(2,3- 

dioleyloxy)propyl)-N,N,N-trimethylammonium chloride (DOTMA), for example, was found 
to increase 1000-fold the antisense effect of a phosophorothioate oligonucleotide. (Vlassov et 
al, 1994, Biochimica et Biophysica Acta 1 197:95-108). Oligonucleotides can also be 
complexed with, e.g., poly (L-lysine) or avidin and lipids may, or may not, be included in this 

1 5 mixture {e.g. , steryl-poly (L-lysine). 

Cationic lipids have been used in the art to deliver oligonucleotides to cells {see, e.g., 
U.S. Patents 5,855,910; 5,851,548; 5,830,430; 5,780,053; 5,767,099; Lewis etal 1996. 
Proc. Natl Acad. Sci USA 93:3176; Hope et al 1998. Molecular Membrane Biology 15:1). 
Other lipid compositions which can be used to facilitate uptake of the instant oligonucleotides 
20 can be used in connection with the claimed methods. In addition to tliose listed supra, other 
lipid compositions are also known in the art and include, e.g., those taught in U.S. patent 
4,235,871; U.S. patents 4,501,728; 4,837,028; 4,737,323. 

In one embodiment lipid compositions can further comprise agents, e.g., viral proteins 
to enhance lipid-mediated transfections of oligonucleotides (Kamata et al 1994. Nucl Acids. 
25 Res. 22:536). In another embodiment, oligonucleotides are contacted with cells as part of a 
composition comprising an oligonucleotide, a peptide, and a lipid as taught, e.g., in U.S. 
patent 5,736,392. Improved lipids have also been described which are serum resistant (Lewis 
et al 1996. Proc. Natl Acad. Set 93:3176). Cationic lipids and other complexing agents act 
to increase the number of oligonucleotides carried into the cell through endocytosis. 
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In another embodiment N-substituted glycine oligonucleotides (peptoids) can be used 
to optimize uptake of oligonucleotides. Peptoids have been used to create cationic lipid-like 
compounds for transfection (Murphy et al 1 998. Proc. Natl Acad. Sci. 95 : 1 5 1 7). Peptoids 
can be synthesized using standard methods (e.g., Zuckermann, R. N., et al. 1992. J. Am. 
5 Chem. Soc. 1 14:10646; Zuckermann, R.N., et al. 1992. Int. J. Peptide Protein Res, 40:497). 
Combinations of cationic lipids and peptoids, liptoids, can also be used to optimize uptake of 
the subject oligonucleotides (Hunag et al 1998. Chemistiy and Biology. 5:345). Liptoids can 
be synthesized by elaborating peptoid oligonucleotides and coupling the amino terminal 
submonomer to a lipid via its amino group (Hunag et al 1998. Chemistry and Biology. 
10 5:345). 

It is known in the art that positively charged amino acids can be used for creating 
highly active cation lipids (Lewis et al 1996. Proc, Natl Acad Sci U.S.A, 93:3176). In one 
f embodiment, a composition for delivering oligonucleotides of the invention comprises a 

number of arginine, lysine, histadine or ornithine residues linked to a lipophilic moiety {see, 
15 e.g., U.S. patent 5,777,153). 

In another, a composition for delivering oUgonucleotides of the invention comprises a 
peptide havuag from between about one to about four basic residues. These basic residues 
can be located, e.g., on the amino terminal, C-terminal, or internal region of the peptide. 
Families of amino acid residues having similar side chains have been defined in the art. 

20 These famiUes include amino acids with basic side chains {e.g. , lysine, arginine, histidine), 
acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., 
glycine (can also be considered non-polar), asparagine, glutamine, serme, threonine, tyrosine, 
cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, 

25 isoleucine) and aromatic side chains (e.g. , tyrosine, phenylalanine, tryptophan, histidine). 
Apart from the basic amino acids, a majority or all of the other residues of the peptide can be 
selected from the non-basic amino acids, e.g., amino acids other than lysine j arginine, or 
histidine. Preferably a preponderance of neutral amino acids with long neutral side chains 
are used. For example, a peptide such as (N-term) His-Ile-Trp-Leu-Ile-Tyr-Leu-Trp-Ile-Val- 

30 (C-term) (SEQ ID NO: ##) could be used. In one embodiment such a composition can 
be mixed with the ftisogenic lipid DOPE as is well known in the art. 
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In one embodiment, the cells to be contacted with an oligonucleotide composition are 
contacted with a mixture comprising the oligonucleotide and a mixture comprising a lipid, 
e.g., one of the lipids or lipid compositions described supra for between about 1 and about 
five days. In one embodiment, the cells are contacted with a mixture comprising a lipid and 
5 the oligonucleotide for between about three days to as long as about 30 days. In another 
embodiment, a mixture comprising a lipid is left in contact with the cells for at least about 
five to about 20 days. In another embodiment, a mixture comprising a lipid is left in contact 
with the cells for at least about seven to about 1 5 days. 

For example, in one embodiment, an oligonucleotide composition can be contacted 
10 with cells in the presence of a lipid such as cytofectin CS or GSV(available from Glen 
Research; Sterling, VA), GS3815, GS2888 for prolonged incubation periods as described 
herein. 

In one embodiment the incubation of the cells with the mixture comprising a lipid and 
an oligonucleotide composition does not reduce the viabiUty of the cells. Preferably, after the 
15 transfection period the cells are substantially viable^ In one embodiment, after transfection, 
the cells are between at least about 70 and at least about 100 percent viable. In another 
embodhnent, the cells are between at least about 80 and at least about 95% viable. In yet 
another embodiment, the cells are between at least about 85% and at least about 90% viable. 

In one embodiment, oligonucleotides are modified by attaching a peptide sequence 
20 that transports the oligonucleotide into a cell, referred to herein as a "transporting peptide." In 
one embodiment, the composition includes an oligonucleotide which is complementary to a 
target nucleic acid molecule encoding the protein, and a covalently attached transporting 
peptide. 

The language "transporting peptide" includes an amino acid sequence that facilitates 
25 the transport of an oligonucleotide into a cell. Exemplary peptides which facilitate the 

transport of the moieties to which they are linked into cells are known in the art, and include, 
e.g., HIV TAT transcription factor, lactoferrin, Herpes VP22 protein, and fibroblast growth 
factor2(Poogae^a/. 1998. Nature Biotechnology. 16:857; and Derossie/ a/. \99S. Trends 
in Cell Biology, 8:84; Elliott and O'Hare. 1997. Cell 88:223). 

30 For example, in one embodiment, the transporting peptide comprises an amino acid 

sequence derived from the antennapedia protein. Preferably, the peptide comprises amino 
acids 43-58 of the antennapedia protein (Arg-GlnTle-Lys-Ile-Trp-Phe-Gln-Asn-Arg-Arg- 
Met-Lys-Trp-Lys-Lys) (SEQ ID NO: ##) or a portion or variant thereof that facilitates 
transport of an oligonucleotide into a cell (see, e.g., WO 91/1898; Derossi et al 1998. Trends 

35 Cell Biol 8:84). Exemplary variants are shown in Derossi et al, supra. 
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In one embodiment, the transporting peptide comprises an amino acid sequence 
derived from the transportan, galanin (l-12)-Lys-mastoparan (1-14) amide, protein. (Pooga 
etal 1998. Nature Biotechnology 16:^57). Preferably, the peptide comprises the amino 
acids of the transportan protein shown in the sequence GWTLNSAGYLLGKINLKA- 
5 LAALAKKIL (SEQ ID NO: ##) or a portion or variant thereof that faciUtates transport 
of an oligonucleotide into a cell. 

In one embodiment, the transporting peptide comprises an amino acid sequence 
derived from the HIV TAT protein. Preferably, the peptide comprises amino acids 37-72 of 
the HIV TAT protein, e.g., shown in the sequence C(Acm)FITKALGISYGRKKRRQR- 

10 RRPPQC (SEQ ID NO: ##) (TAT 37-60; where C(Acm) is Cys-acetamidomethyl) or a 
portion or variant thereof, e.g., C(Acm)GRKKJRRQRRRPPQC (SEQ ID NO: ##) (TAT 
48-40) or C(Acm)LGISYGRKKRRQRRPPQC (SEQ ID NO: ##) (TAT 43-60) that 
facilitates transport of an oligonucleotide into a cell (Vives et al 1997. J. Biol Chem. 
272:16010). In another embodiment the peptide (G)CFITKALGISYGRKKRR- 

15 QRRRPPQGSQTHQVSLSKQ (SEQ ID NO: ##) can be used. 

Portions or variants of transporting peptides can be readily tested to determine 
whether they are equivalent to these peptide portions by comparing their activity to the 
activity of the native peptide, e.g., their ability to transport fluorescently labeled 
oligonucleotides to cells. Fragments or variants that retain the ability of the native 
20 transporting peptide to transport an oligonucleotide into a cell are fimctionally equivalent and 
can be substituted for the native peptides. 

Oligonucleotides can be attached to the transporting peptide using known techniques 
(e.g., Prochiantz, A. 1996. Curr. Opin, NeurobioL 6:629; Derossi etal 1998. Trends Cell 
Biol 8:84; Troy et al 1996. J. NeuroscL 16:253), Vives et al 1997. J. Biol Chem, 

25 272:16010). For example, in one embodiment, oligonucleotides bearing an activated thiol 
group are linked via that thiol group to a cysteine present in a transport peptide (e.g., to the 
cysteine present in the p turn between the second and the third helix of the antennapedia 
homeodomain as taught, e.g., in Derossi et al 1998. Trends Cell Biol 8:84; Prochiantz. 1996. 
Current Opinion in NeurobioL 6:629; AUinquant et al 1995. 1 Cell Biol 128:919). In 

30 another embodiment, a Boc-Cys-(Npys)OH grov^) can be coupled to the transport peptide as 
the last (N-terminal) amino acid and an oligonucleotide bearing an SH group can be coupled 
to the peptide (Troy et al 1996. J. NeuroscL 16:253). 
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In one embodiment, a linking group can be attached to a nucleomonomer and the 
transporting peptide can be covalently attached to the linker. In one embodiment, a linker 
can function as both an attachment site for a transporting peptide and can provide stability 
against nucleases. Examples of suitable linkers include substituted or unsubstituted C,-C2o 
5 alkyl chains, Cj-C^o alkenyl chains, CpCjo alkynyl chains, peptides, and heteroatoms (e.g., S, 
0, NH, etc.). Other exemplary linkers include bifunctional crosslinking agents such as 
sulfosuccinimidyl-4-(maleimidophenyl)-butyrate (SMPB) (see, e.g.. Smith et al Biochem J 
1991.276:417-2). 

In one embodiment, oligonucleotides of the invention are synthesized as molecular 
1 0 conjugates which utilize receptor-mediated endocytotic mechanisms for delivering genes into 
cells {see, e.g., Bunnell et al 1992. Somatic Cell and Molecular Genetics, 18:559 and the 
references cited therein). 

Tar^^etin^ Agents 

1 5 The delivery of oligonucleotides can also be improved by targeting the 

oHgonucleotides to a cellular receptor. The targeting moieties can be conjugated to the 
oligonucleotides or attached to a carrier group (i.e., poly(L-lysme) or liposomes) linked to the 
oligonucleotides. This method is well suited to cells that display specific receptor-mediated 
endocytosis. 

20 For instance, oligonucleotide conjugates to 6-phosphomannosylated proteins are 

internalized 20-fold more efficiently by cells expressing mannose 6-phosphate specific 
receptors than fi-ee oligonucleotides. The ohgonucleotides may also be coupled to a ligand 
for a cellular receptor using a biodegradable linker. In another example, the delivery 
construct is mannosylated streptavidin which forms a tight complex with biotinylated 

25 oligonucleotides. Mannosylated streptavidin was found to increase 20-fold the internalization 
of biotinylated oligonucleotides. (Vlassov etal 1994. Biochimica et Biophysica Acta 
1197:95-108). 

In addition specific ligands can be conjugated to the polylysine component of 
polylysine-based delivery systems. For example, transferrin-polylysine, adenovirus- 
30 polylysine, and influenza virus hemagglutinin HA-2 N-terminal fiisogenic peptides- 

polylysine conjugates greatly enhance receptor-mediated DNA delivery in eucaryotic cells. 
Mannosylated glycoprotein conjugated to poly(L-lysine) in aveolar macrophages has been 
employed to enhance the cellular uptake of oligonucleotides. Liang et al 1999. Pltarmazie 
54:559-566. 
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Because malignant cells have an increased need for essential nutrients such as folic 
acid and transferrin, these nutrients can be used to target oligonucleotides to cancerous cells. 
For example, when folic acid is linked to poly(L-lysine) enhanced oligonucleotide uptake is 
seen in promyelocytic leukaemia (HL-60) cells and human melanoma (M-14) cells. Ginobbi 
5 et al 1997. Anticancer Res. 17:29. In another example, liposomes coated with maleylated 
bovine serum albumm, folic acid, or ferric protoporphyrin EX, show enhanced cellular uptake 
of oligonucleotides In murine macrophages, KB cells, and 2.2.15 human hepatoma cells. 
Liang et al 1 999. Pharmazie 54:559-566. 

Liposomes are naturally targeted to the liver, spleen, and reticuloendothelial system. 
10 By coupling liposomes to various ligands such as antibodies are protein A, they can be 
targeted to specific cell populations. For example, protein A-bearing liposomes may be 
pretreated with H-2K specific antibodies which are targeted to the mouse major 
histocompatibility complex-encoded H-2K protein expressed on L cells. (Vlassov et al 1994. 
Biochimica et Biophysica Acta 1197:95-108). 

15 

Assays of Oligonucleotide Stability 

Preferably, the oligonucleotides of the invention are stabilized, z.e., substantially 
resistant to endonuclease and exonuclease degradation. An oligonucleotide is defined as 
being substantially resistant to nucleases when it is at least about 3-fold more resistant to 
20 attack by an endogenous cellular nuclease, and is highly nuclease resistant when it is at least 
about 6-fold more resistant than a corresponding, unmodified oligonucleotide. This can be 
demonstrated by showmg that the oligonucleotides of the invention are substantially resist 
nucleases using techniques which are known in the art. 

One way in which substantial stability can be demonstrated is showing that the 
25 oligonucleotides of the invention function when delivered to a cell, e.g. , that they reduce 
transcription or translation of target nucleic acid molecules, e.g., by measuring protein levels 
or by measuring cleavage of mRNA. Assays which measure the stability of target KNA can - 
be performed at about 24 hours post-transfection (e.g., using Northern blot techniques, RNase 
Protection Assays, or QC-PCR assays as known in the art). Alternatively, levels of the target 
30 protem can be measured. Preferably, in addition to testmg the RNA or protein levels of 
interest, the RNA or protein levels of a control, non-targeted gene will be measured (e.g., 
actin, or preferably a control with sequence similarity to the target) as a specificity control. 
RNA or protein measurements can be made using any art-recognized technique. Preferably, 
measurements will be made beginning at about 16-24 hours post transfection. (M. Y. Chiang, 
35 etal. 1991. JBiolChem. 266:18162-71; T. Fisher, e/ (3/. 1993. Nucleic Acids Research 
21 3857). 
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The ability of an oligonucleotide composition of the invention to inhibit protein 
synthesis can be measured using techniques which are known in the art, for example, by 
detecting an inhibition in gene transcription or protein synthesis. For example. Nuclease SI 
mapping can be performed. In another example, Northern blot analysis can be used to 

5 measure the presence of RNA encoding a particular protein. For example, total RNA can be 
prepared over a cesium chloride cushion {see, e,g., Ausebel et al, 1987. Current Protocols in 
Molecular Biology (Greene & Wiley, New York)). Northern blots can then be made using 
the RNA and probed (see, e.g., Id.). In another example, the level of the specific mRNA 
produced by the target protein can be measured, e.g., using PCR. In yet another example, 

1 0 Westem blots can be used to measure the amount of target protein present. In still another 
embodiment, a phenotype influenced by the amount of the protein can be detected. 
Techniques for performing Westem blots are well known in the art, ^ee, e.g., Chen et al J. 
Biol Chem. 271:28259. 

In another example, the promoter sequence of a target gene can be linked to a reporter 
1 5 gene and reporter gene transcription (e.g. , as described in more detail below) can be 

monitored. Altematively, oligonucleotide compositions that do not target a promoter can be 
identified by fiising a portion of the target nucleic acid molecule with a reporter gene so that 
the reporter gene is transcribed. By monitoring a change in the expression of the reporter 
gene in the presence of the oligonucleotide composition, it is possible to determine the 
20 effectiveness of the oligonucleotide composition in inhibiting the expression of the reporter 
gene. For example, in one embodiment, an effective oligonucleotide composition will reduce 
the expression of the reporter gene. 

A "reporter gene" is a nucleic acid that expresses a detectable gene product, which 
may be RNA or protein. Detection of mRNA expression may be accomplished by Northern 

25 blotting and detection of protein may be accomplished by staining with antibodies specific to 
the protein. Preferred reporter genes produce a readily detectable product. A reporter gene 
may be operably linked with a regulatory DNA sequence such that detection of the reporter 
gene product provides a measure of the transcriptional activity of the regulatory sequence. In 
preferred embodiments, the gene product of the reporter gene is detected by an intrinsic 

30 activity associated with that product. For instance, the reporter gene may encode a gene 
product that, by enzymatic activity, gives rise to a detectable signal based on color, 
fluorescence, or luminescence. Examples of reporter genes include, but are not limited to, 
those coding for chloramphenicol acetyl transferase (CAT), luciferase, p-galactosidase, and 
alkaline phosphatase. 
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One skilled in the art would readily recognize numerous reporter genes suitable for 
use in the present invention. These include, but are not limited to, chloramphenicol 
acetyltransferase (CAT), luciferase, human growth hormone (hGH), and beta-galactosidase. 
Examples of such reporter genes can be found in F. A. Ausubel et al, Eds., Current Protocols 
5 in Molecular Biology, John Wiley & Sons, New York, (1989). Any gene that encodes a 
detectable product, e.g,, any product having detectable enzymatic activity or against which a 
specific antibody can be raised, can be used as a reporter gene in the present methods. 

One reporter gene system is the firefly luciferase reporter system. (Gould, S. J., and 
Subramani, S. 1988. Anal Biochem., 7:404-408 incorporated herein by reference). The 
10 luciferase assay is fast and sensitive. In this assay, a lysate of the test cell is prepared and 
combined with ATP and the substrate luciferin. The encoded enzyme luciferase catalyzes a 
rapid, ATP dependent oxidation of the substrate to generate a light-emitting product. The 
total light output is measured and is proportional to the amount of luciferase present over a 
wide range of enzyme concentrations. 

1 5 CAT is another jfrequently used reporter gene system; a major advantage of this 

system is that it has been an extensively validated and is widely accepted as a measure of 
promoter activity. (Gorman C. M., Moffat, F., and Howard, B. H. 1982, Mol. Cell. Biol., 
2:1044-1051). In this system, test cells are transfected with CAT expression vectors and 
incubated with the candidate substance within 2-3 days of the initial transfection. Thereafter, 

20 cell extracts are prepared. The extracts are incubated with acetyl CoA and radioactive 

chloramphenicol. Follov^ng the incubation, acetylated chloramphenicol is separated from 
nonacetylated form by thin layer chromatography. In this assay, the degree of acetylation 
reflects the CAT gene activity with the particular promoter. 

Another suitable reporter gene system is based on immunologic detection of hGH. 

25 This system is also quick and easy to use. (Selden, R., Burke-Howie, K. Rowe, M, E., 
Goodman, H. M., and Moore, D. D. (1986), Mol. Cell, Biol., 6:3173-3179 incorporated 
herein by reference). The hGH system is advantageous in that the expressed hGH 
polypeptide is assayed in the media, rather than in a cell extract. Thus, this system does not 
require the destruction of the test cells. It will be appreciated that the principle of this 

30 reporter gene system is not limited to hGH but rather adapted for use with any polypeptide for 
which an antibody of acceptable specificity is available or can be prepared. 
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Oligonucleotide Synthesis 

Oligonucleotides of the invention can be synthesized by any methods known in the 
art, e.g., using enzymatic synthesis and chemical synthesis. The oligonucleotides can be 
synthesized in vitro (e.g., using enzymatic synthesis and chemical synthesis) or in vivo (using 
5 recombinant DNA technology well known in the art. 

In a preferred embodiment, chemical synthesis is used. Chemical synthesis of linear 
oligonucleotides is well known in the art and can be achieved by solution or solid phase 
techniques. Preferably, synthesis is by solid phase methods. Oligonucleotides can be made 
by any of several different synthetic procedures including the phosphoramidite, phosphite 
1 0 triester, H-phosphonate, and phosphotriester methods, typically by automated synthesis 
methods. 

Oligonucleotide synthesis protocols are well known in the art and can be found, e.g., 
m U.S. patent 5,830,653; WO 98/13526; Stec et al 1984. J. Am. Chem. Soc. 106:6077; Stec 
et al 1985. J. Org. Chem, 50:3908; Stec et al J. Chroraatog. 1985. 326:263; LaPlanche et al. 
15 1986. Nuc. Acid. Res. 1986. 14:9081; Fasman G. D., 1989. Practical Handbook of 

Biochemistry and Molecular Biology. 1989. CRC Press, Boca Raton, Fla.; Lamone. 1993. 
Biochem. Soc, Trans, 21:1; U.S. Patent 5,013,830; U.S. Patent 5,214,135; U.S. Patent 
5,525,719; Kawasaki etal 1993. J. Med Chem. 36:831; WO 92/03568; U.S. Patent 
5,276,019; U.S. Patent 5,264,423. 

20 The synthesis method selected can depend on the length of the deshred oligonucleotide 

and such choice is within the skill of the ordinary artisan. For example, the phosphoramidite 
and phosphite triester method produce ohgonucleotides having 175 or more nucleotides while 
the H-phosphonate method works well for oligonucleotides of less than 100 nucleotides. If 
modified bases are incoiporated into the oligonucleotide, and particularly if modified 

25 phosphodiester linkages are used, then the synthetic procedures are altered as needed 
according to known procedures. In this regard, Uhlmann et al. (1990, Chemical Reviews 
90:543-584) provide references and outline procedures for making oligonucleotides with 
modified bases and modified phosphodiester Hnkages. Other exemplary methods for making 
oligonucleotides are taught in Sonveaux. 1994. "Protecting Groups in Oligonucleotide 

30 Synthesis"; Agrawal. Methods in Molecular Biology 26: 1 . Exemplary synthesis methods are 
also taught in "Oligonucleotide Synthesis- A Practical Approach" (Gait, M.J. IRL Press at 
Oxford University Press. 1984). Moreover, linear oligonucleotides of defined sequence can 
be purchased commercially. 
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The oligonucleotides may be purified by polyacrylamide gel electrophoresis, or by 
any of a number of chromatographic methods, including gel chromatography and high 
pressure liquid chromatography. To confirm a nucleotide sequence, oligonucleotides may be 
subjected to DNA sequencing by any of the known procedures, including Maxam and Gilbert 

5 sequencing, Sanger sequencing, capillary electrophoresis sequencing the wandering spot 
sequencing procedure or by using selective chemical degradation of oligonucleotides bound 
to Hybond paper. Sequences of short oligonucleotides can also be analyzed by laser 
desorption mass spectroscopy or by fast atom bombardment (McNeal, et al , 1 982, J. Am, 
Chem. Soc. 104:976; Viari, et al, 1987, Biomed Environ. Mass Spectrom. 14:83; Grotjahn et 

10 al,\ 982, Nuc, Acid Res. 10:4671). Sequencing methods are also available for RNA 
oligonucleotides. 

The quality of oligonucleotides synthesized can be verified by testing the 
oligonucleotide by capillary electrophoresis and denaturing strong anion HPLC (SAX-HPLC) 
using, e.g., the method of Bergot and Egan. 1992. J. Chrom, 599:35. 

1 5 Other exemplary synthesis techniques are well known in the art (see, e.g., Sambrook 

et al , Molecular Cloning: a Laboratory Manual, Second Edition (1 989); DNA Cloning, 
Volumes I and II (DN Glover Ed. 1985); Oligonucleotide Synthesis (MJ Gait Ed, 1984; 
Nucleic Acid Hybridisation (BD Hames and SJ Higgins eds. 1984); A Practical Guide to 
Molecular Cloning (1984); or the series. Methods in Enzymology (Academic Press, Inc.)). 

20 

Uses o f Oligonucleotides 

This invention also features methods of inhibiting expression of a protein in a cell 
including contacting the cell with one of the above-described oligonucleotide compositions. 

The oligonucleotides of the invention can be used in a variety of in vitro and in vivo 
25 situations to specifically inhibit protein expression. The instant methods and compositions 
are suitable for both in vitro and in vivo use. 

In one embodiment, the oligonucleotides of the invention can be used to inhibit gene 
function in viti'o in a method for identifying the functions of genes. In this manner, the 
transcription of genes that are identified, but for which no fimction has yet been shown, can 
30 be inhibited to thereby determine how the phenotype of a cell is changed when the gene is not 
transcribed. Such methods are useful for the validation of genes as targets for clinical 
treatment, e.g., with oligonucleotides or with other therapies. 
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To determine the effect of a composition of the invention, a variety of end points can 
be used. In addition to the assays described previously herein, for example, nucleic acid 
probes {e.g. , in the form of arrays) can be used to evaluate transcription pattems produced by 
cells. Probes can also be used detect peptides, proteins, or protein domains, e.g., antibodies 
5 can be used to detect the expression of a particular protein. In yet another embodiment, the 
function of a protein {e.g., enzymatic activity) can be measured. In yet another embodiment, 
the phenotype of a cell can be evaluated to determine whether or not a target protein is 
expressed. For example, the ability of a composition to affect a phenotype of a cell that is 
associated with cancer can be tested. 

10 In one embodiment, one or more additional agents {e.g., activating agents, inducing 

agents, proliferation enhancing agents, tumor promoters) can be added to the cells. 

In another embodiment, the compositions of the invention can be used to monitor 
biochemical reactions such as, e.g., interactions of proteins, nucleic acids, small molecules, or 
the like— for example the efficiency or specificity of interactions between antigens and 

1 5 antibodies; or of receptors (such as purified receptors or receptors boimd to cell membranes) 
and their ligands, agonists or antagonists; or of enzymes (such as proteases or kinases) and 
their substrates, or increases or decreases in the amount of substrate converted to a product; as 
well as many others. Such biochemical assays can be used to characterize properties of the 
probe or target, or as the basis of a screening assay. For example, to screen samples for the 

20 presence of particular proteases {e.g., proteases involved in blood clotting such as proteases 
Xa and Vila), the samples can be assayed, for example using probes which are fluorogenic 
substrates specific for each protease of interest. If a target protease binds to and cleaves a 
substrate, the substrate will fluoresce, usually as a result, e.g., of cleavage and separation 
between two energy transfer pairs, and the signal can be detected. In another example, to . 

25 screen samples for the presence of a particular kinase(s) (e.g. , a tyrosine kinase), samples 
containing one or more kinases of interest can be assayed, e.g., using probes are peptides 
which can be selectively phosphorylated by one of the kinases of interest. Using art- 
recognized, routinely determinable conditions, samples can be incubated with an array of 
substrates, in an appropriate buffer and with the necessary cofactors, for an empirically 

30 determined period of time. If necessary, reactions can be stopped, e.g., by washing and the 
phosphorylated substrates can be detected by, for example, incubating them with detectable 
reagents such as, e.g., fluorescein-labeled anti-phosphotyrosine or anti-phosphoserine 
antibodies and the signal can be detected. 
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In another embodiment, the compositions of the invention can be used to screen for 
agents which modulate a pattern of gene expression. Arrays of oligonucleotides can be used, 
for example, to identify mRNA species whose pattern of expression from a set of genes is 
correlated with a particular physiological state or developmental stage, or with a disease 
5 condition ("correlative" genes, RNAs, or expression patterns). By the terms "correlate" or 
"correlative," it is meant that the synthesis pattern of RNA is associated with the 
physiological condition of a cell, but not necessarily that the expression of a given RNA is 
responsible for or is causative of a particular physiological state. For example, a small subset 
of mRNAs can be identified which are modulated (e.g., upregulated or dowmegulated) in 
10 cells which serve as a model for a particular disease state. This altered pattern of expression 
as compared to that in a normal cell, which does not exhibit a pathological phenotype, can 
serve as a indicator of the disease state ("indicator" or "correlatvie" genes, RNAs, or 
expression patterns). 

The invention also relates to a selecting oligonucleotides for the methods described 
1 5 herein in which in which many oligomers are screened (e.g. , from about 1 0-20 to 

significantly greater nimibers as may be found in a combinatorial library), after which the 
more efficacious oligomers are chosen and combined to produce a composition of the 
invention. Thus, inhibition of greater than 95%, 90%, 85%, 80%, 70%, or 60% may be 
achieved. 

20 Compositions which modulate the chosen indicator expression pattern (e.g. , compared 

to control compositions comprising, for example oligonucleotides which comprise a 
nucleotide sequence which is the reverse of the oligonucleotide, or which contains mismatch 
bases) can indicate that a particular target gene is a potential target for therapeutic 
intervention. Moreover, such compositions may be useful as therapeutic agents to modulate 

25 expression patters of cells in an in vitro expression system or in in vivo therapy. As used 
herein, "modulate" means to cause to increase or decrease the amoimt or activity of a 
molecule or the like which is involved in a measurable reaction. In one embodiment, a series 
of cells (e.g., fi-om a disease model) can be contacted with a series of agents (e.g., for a period 
of time ranging firom about 10 minutes to about 48 hours or more) and, using routine, art- 

30 recognized methods (e.g. , commercially available kits), total RNA or mRNA extracts can be 
made. If it is desired to amplify the amount of RNA, standard procedures such as RT-PCR 
amplification can be used (see, e.g., Innis et a/.eds., (1996) PGR Protocols: A Guide to 
Methods in Amplification, Academic Press, New York). The extracts (or amplified products 
from them) can be allowed to contact (e.g., incubate with) probes for appropriate indicator 

35 RNAs, and those agents which are associated with a change in the indicator expression 
pattern can be identified. 
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Similarly, agents can be identified which modulate expression patterns associated 
with particular physiological states or developmental stages. Such agents can be man-made or 
naturally-occurring substances, including environmental factors such as substances involved 
in embryonic development or in regulating physiological reactions. 

5 In one embodiment, the methods described herein can be performed in a "high 

throughput" manner, in which a large number of target genes (e.g., as many as about 1000 or 
more, depending on the particular format used) are assayed rapidly and concurrently. 
Further, many assay formats (e.g., plates or surfaces) can be processed at one time. For 
example, because the oligonucleotides of the invention do not need to be tested individually 

1 0 before incorporating them into a composition, they can be readily synthesized and large 
numbers of target genes can be tested at one time. For example, a large number of samples, 
each comprising a biological sample containing a target nucleic acid molecule (e.g., a cell) 
and a composition of the invention can be added to separate regions of an assay format and 
assays can be performed on each of the samples. 

15 

Administration of Oligonucleotide Compositions 

The optimal course of administration or delivery of the oligonucleotides may vary 
depending upon the desired result and/ or on the subject to be treated. As used herein 
"administration" refers to contacting cells with oligonucleotides and can be performed in 
20 vitro or in vivo. The dosage of oligonucleotides may be adjusted to optimally reduce 

expression of a protein translated fi-om a target nucleic acid molecule, e.g., as measured by a 
readout of RNA stability or by a therapeutic response, without undue experimentation. 

For example, expression of the protein encoded by the nucleic acid target can be 
measured to determine whether or not the dosage regimen needs to be adjusted accordingly. 
25 In addition, an increase or decrease in RNA or protein levels m a cell or produced by a cell 
can be measured using any art recognized technique. By determining whether transcription 
has been decreased, the effectiveness of the oligonucleotide in inducing the cleavage of a 
target RNA can be determined. 

Any of the above-described oligonucleotide compositions can be used alone or in 
30 conjunction with a pharmaceutically acceptable carrier. As used herein, "pharmaceutically 
acceptable carrier" includes appropriate solvents, dispersion media, coatings, antibacterial 
and antifungal agents, isotonic and absorption delaying agents, and the like. The use of such 
media and agents for pharmaceutical active substances is well known in the art. Except 
insofar as any conventional media or agent is incompatible with the active ingredient, it can 
35 be used in the therapeutic compositions. Supplementary active ingredients can also be 
incorporated into the compositions. 
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Oligonucleotides may be incorporated into liposomes or liposomes modified with 
polyethylene glycol or admixed with cationic lipids for parenteral administration. 
Incorporation of additional substances into the liposome, for example, antibodies reactive 
against membrane proteins found on specific target cells, can help target the oligonucleotides 
5 to specific cell types. 

Moreover, the present invention provides for administering the subject 
oligonucleotides with an osmotic pump providing continuous infusion of such 
oligonucleotides, for example, as described in Rataiczak, et al (1992 Proc, Natl Acad Set 
USA 89:1 1 823-1 1827). Such osmotic pumps are commercially available, e.g., from Alzet Inc. 
10 (Palo Alto, Calif). Topical administration and parenteral administration in a cationic lipid 
carrier are preferred. 

With respect to in vivo applications, the formulations of the present invention can be 
administered to a patient in a variety of forms adapted to the chosen route of administration, 
e.g., parenterally, orally, or intraperitoneally. Parenteral administration, which is preferred, 
1 5 includes administration by the following routes: intravenous; intramuscular; interstitially ; 
intraarterially; subcutaneous; intra ocular; intrasynovial; trans epithelial, including 
transdermal; pxilmonary via inhalation; ophthalmic; sublingual and buccal; topically, 
including ophthalmic; dermal; ocular; rectal; and nasal inhalation via insufflation. 

Pharmaceutical preparations for parenteral administration include aqueous solutions 
20 of the active compounds in water-soluble or water-dispersible form. In addition, suspensions 
of the active compounds as appropriate oily injection suspensions may be administered. 
Suitable lipophilic solvents or vehicles include fatty oils, for example, sesame oil, or 
synthetic fatty acid esters, for example, ethyl oleate or triglycerides. Aqueous injection 
suspensions may contain substances which increase the viscosity of the suspension include, 
25 for example, sodium carboxymethyl cellulose, sorbitol, or dextran, optionally, the suspension 
may also contain stabilizers. The oligonucleotides of the invention can be formulated in 
Uquid solutions, preferably in physiologically compatible buffers such as Hank's solution or 
Ringer's solution. In addition, the oligonucleotides may be formulated in solid form and 
redissolved or suspended immediately prior to use. Lyophilized forms are also included in the 
30 invention. 

Pharmaceutical preparations for topical administration include transdermal patches, 
ointments, lotions, creams, gels, drops, sprays, suppositories, liquids and powders. In 
addition, conventional pharmaceutical carriers, aqueous, powder or oily bases, or thickeners 
may be used in pharmaceutical preparations for topical administration. 
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Pharmaceutical preparations for oral administration include powders or granules, 
suspensions or solutions in water or non-aqueous media, capsules, sachets or tablets. In 
addition, thickeners, flavoring agents, diluents, emulsifiers, dispersing aids, or binders may be 
used in pharmaceutical preparations for oral administration. 

5 For transmucosal or transdermal administration, penetrants appropriate to the barrier 

to be permeated are used in the formulation. Such penetrants are known in the art, and 
include, for example, for transmucosal administration bile salts and fusidic acid derivatives, 
and detergents. Transmucosal administration may be through nasal sprays or using 
suppositories. For oral administration, the oligonucleotides are formulated into conventional 
10 oral administration forms such as capsules, tablets, and tonics. For topical administration, the 
oligonucleotides of the invention are formulated into ointments, salves, gels, or creams as 
known in the art. 

Drug delivery vehicles can be chosen e.g., for in vitro, for systemic, or for topical 
administration. These vehicles can be designed to serve as a slow release reservoir or to 

15 deliver their contents directly to the target cell. An advantage of using some direct delivery 
drug vehicles is that multiple molecvdes are delivered per uptake. Such vehicles have been 
shown to increase the circulation half-life of drugs that would otherwise be rapidly cleared 
from the blood stream. Some examples of such specialized drug delivery vehicles which fall 
into this category are liposomes, hydrogels, cyclodextrins, biodegradable nanocapsules, and 

20 bioadhesive microspheres. 

The described oligonucleotides may be administered systemically to a subject. 
Systemic absorption refers to the entry of drugs into the blood stream followed by 
distribution throughout the entire body. Admmistration routes which lead to systemic 
absorption include: intravenous, subcutaneous, mtraperitoneal, and intranasal. Each of these 

25 administration routes delivers the oligonucleotide to accessible diseased cells. Following 
subcutaneous administration, the therapeutic agent drains into local lymph nodes and 
proceeds through the lymphatic network into the circulation. The rate of entry into the 
circulation has been shown to be a function of molecular weight or size. The use of a 
liposome or other drug carrier localizes the oligonucleotide at the lymph node. The 

30 oligonucleotide can be modified to diffiise into the cell, or the liposome can directly 

participate in the delivery of either the unmodified or modified oligonucleotide into the cell. 

The chosen method of delivery will result in entry into cells. Preferred delivery 
methods include liposomes (10-400 nm), hydrogels, controUed-release polymers, and other 
pharmaceutically applicable vehicles, and microinjection or electroporation (for ex vivo 
35 treatments). 
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The pharmaceutical preparations of the present invention may be prepared and 
formulated as emulsions. Emulsions are usually heterogenous systems of one liquid 
dispersed in another in the form of droplets usually exceeding 0.1 ^im in diameter. 

The emulsions of the present invention may contain excipients such as emulsifiers, 
5 stabilizers, dyes, fats, oils, waxes, fatty acids, fatty alcohols, fatty esters, humectants, 
hydrophilic colloids, preservatives, and anti-oxidants may also be present in emulsions as 
needed. These excipients may be present as a solution in either the aqueous phase, oily phase 
or itself as a separate phase. 

Examples of naturally occurring emulsifiers that may be used in emulsion 
10 formulations of the present invention include lanolin, beeswax, phosphatides, lecithin and 
acacia. Finely divided solids have also been used as good emulsifiers especially in 
combination with surfactants and in viscous preparations. Examples of finely divided solids 
that may be used as emulsifiers include polar inorganic solids, such as heavy metal 
hydroxides, nonswelling clays such as bentonite, attapulgite, hectorite, kaolin, 
1 5 montmorillonite, colloidal aluminum silicate and colloidal magnesium aluminum silicate, 
pigments and nonpolar solids such as carbon or glyceryl tristearate. 

Examples of preservatives that may be included in the emulsion formulations include 
methyl paraben, propyl paraben, quaternary ammonium salts, benzalkonium chloride, esters 
of p'hydroxybenzoic acid, and boric acid. Examples of antioxidants that may be included in 
20 the emulsion formulations mclude fi-ee radical scavengers such as tocopherols, alkyl gallates, 
butylated hydroxyanisole, butylated hydroxytoluene, or reducing agents such as ascorbic acid 
and sodium metabisulfite, and antioxidant synergists such as citric acid, tartaric acid, and 
lecithin. 

In one embodiment, the compositions of oligonucleotides are formulated as 
25 microemulsions. A microemulsion is a system of water, oil and amphiphile which is a single 
optically isotropic and thermodynamically stable liquid solution. Typically microemulsions 
are prepared by first dispersing an oil in an aqueous surfactant solution and then adding a 
sufficient amount of a 4th component, generally an intermediate chain-length alcohol to form 
a transparent system. 
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Surfactants that may be used in the preparation of microemulsions mclude, but are not 
limited to, ionic surfactants, non-ionic surfactants, Brij 96, polyoxyethylene oleyl ethers, 
polyglycerol fatty acid esters, tetraglycerol monolaurate (ML310), tetraglycerol monooleate 
(MO310), hexaglycerol monooleate (PO310), hexaglycerol pentaoleate (PO500), 
5 decaglycerol monocaprate (MCA750), decaglycerol monooleate (MO750), decaglycerol 
sequioleate (S0750), decaglycerol decaoleate (DA0750), alone or in combination with 
cosurfactants. The cosurfactant, usually a short-chain alcohol such as ethanol, 1-propanol, and 
1-butanol, serves to increase the interfacial fluidity by penetrating into the surfactant fibn and 
consequently creating a disordered film because of the void space generated among surfactant 
10 molecules. 

Microemulsions may, hov^ever, be prepared without the use of cosurfactants and 
alcohol-free self-emulsifying microemulsion systems are known in the art. The aqueous phase 
may typically be, but is not limited to, water, an aqueous solution of the drug, glycerol, 
PEG300, PEG400, polyglycerols, propylene glycols, and derivatives of ethylene glycol. The 
15 oil phase may include, but is not limited to, materials such as Captex 300, Captex 355, 
Capmul MCM, fatty acid esters, medium chain (C8-C12) mono, di, and tri-glycerides, 
polyoxyethylated glyceryl fatty acid esters, fatty alcohols, polyglycolized glycerides, 
saturated polyglycolized CS-C 10 glycerides, vegetable oils and silicone oil. 

Microemulsions are particularly of interest from the standpoint of drug solubilization 
20 and the enhanced absorption of drugs. Lipid based microemulsions (both oil/water and 
water/oil) have been proposed to enhance the oral bioavailability of drugs. 

Microemulsions offer improved drug solubilization, protection of drug from 
enzymatic hydrolysis, possible enhancement of drug absorption due to surfactant-induced 
alterations in membrane fluidity and permeability, ease of preparation, ease of oral 

25 administration over soUd dosage forms, improved clinical potency, and decreased toxicity 
(Constantinides et al. Pharmaceutical Research, 1994, 1 1 :1385; Ho et al, J. Pharm. Sci., 
1996, 85:138-143). Microemulsions have also been effective in the transdermal delivery of 
active components in both cosmetic and pharmaceutical applications. It is expected that the 
microemulsion compositions and formulations of the present invention will facilitate the 

30 increased systemic absorption of oligonucleotides from the gastrointestinal tract, as well as 
improve the local cellular uptake of oligonucleotides within the gastrointestinal tract, vagina, 
buccal cavity and other areas of administration. 
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In an embodiment, the present invention employs various penetration enhancers to 
effect the efficient dehvery of nucleic acids^ particularly oligonucleotides, to the skin of 
animals. Even non-lipophilic drugs may cross cell membranes if the membrane to be crossed 
is treated v^ith a penetration enhancer. In addition to increasing the diffusion of non- 
5 lipophilic drugs across cell membranes, penetration enhancers also act to enhance the 
permeability of lipophilic drugs. 

Five categories of penetration enhancers that may be used in the present invention 
include: surfactants, fatty acids, bile salts, chelating agents, and non-chelating non- 
surfactants Other agents may be utilized to enhance the penetration of the administered 
10 oligonucleotides include: glycols such as ethylene glycol and propylene glycol, pyrrols such 
as 2-15 pyrrol, azones, and terpenes such as limonene and menthone. 

The oligonucleotides, especially in lipid formulations, can also be adniinistered by 
coating a medical device, for example, a catheter, such as an angioplasty balloon catheter, 
with a cationic lipid formulation. Coating may be achieved, for example, by dipping the 

1 5 medical device into a lipid formulation or a mixture of a lipid formulation and a suitable 
solvent, for example, an aqueous-based buffer, an aqueous solvent, ethanol, methylene 
chloride, chloroform and the like. An amount of the formulation will naturally adhere to the 
surface of the device which is subsequently administered to a patient, as appropriate. 
Alternatively, a lyophilized mixture of a lipid formulation may be specifically bound to the 

20 surface of the device. Such binding techniques are described, for example, in K. Ishihara et 
al. Journal of Biomedical Materials Research, Vol. 27, pp. 1309-1314 (1993), the disclosures 
of which are incorporated herein by reference in their entirety. 
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The useftd dosage to be administered and the particular mode of administration will 
vary depending upon such factors as the cell type, or for in vivo use, the age, weight and the 
particular animal and region thereof to be treated, the particular oligonucleotide and delivery 
method used, the therapeutic or diagnostic use contemplated, and the form of the formulation, 
5 for example, suspension, emulsion, micelle or liposome, as will be readily apparent to those 
skilled in the art. Typically, dosage is administered at lower levels and increased until the 
desired effect is achieved. When lipids are used to deliver the oligonucleotides, the amount of 
lipid compound that is administered can vary and generally depends upon the amount of 
oligonucleotide agent being administered. For example, the weight ratio of lipid compound to 

1 0 oligonucleotide agent is preferably from about 1 : 1 to about 15:1, with a weight ratio of about 
5:1 to about 10:1 being more preferred. Generally, the amount of cationic lipid compound 
which is administered will vary from between about 0.1 milligram (mg) to about 1 gram (g). 
By way of general guidance, typically between about 0.1 mg and about 10 mg of the 
particular oligonucleotide agent, and about 1 mg to about 100 mg of the lipid compositions, 

1 5 each per kilogram of patient body weight, is administered, although higher and lower 
amounts can be used. 

The agents of the invention are administered to subjects or contacted with cells in a 
biologically compatible form suitable for pharmaceutical administration. By "biologically 
compatible form suitable for administration " is meant that the oligonucleotide is 
20 administered in a form in which any toxic effects are outweighed by the therapeutic effects of 
the oligonucleotide. In one embodiment, oligonucleotides can be administered to subjects. 
Examples of subjects include manamals, e.g., humans, cows, pigs, horses, dogs, cats, mice, 
rats, and transgenic non-human animals. 

Administration of an active amount of an oligonucleotide of the present invention is 
25 defined as an amount effective, at dosages and for periods of time necessary to achieve the 
desired result For example, an active amount of an oligonucleotide may vary according to 
factors such as the type of cell, the oligonucleotide used, and for in vivo uses the disease state, 
age, sex, and weight of the individual, and the ability of the oligonucleotide to elicit a desired 
response in the individual Establishment of therapeutic levels of oligonucleotides within the 
30 cell is dependent upon the rates of uptake and efflux or degradation. Decreasing the degree of 
degradation prolongs the intracellular half-life of the oligonucleotide. Thus, chemically- 
modified oligonucleotides, e.g., with modification of the phosphate backbone, may require 
different dosing. 
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The exact dosage of an oligonucleotide and number of doses administered will depend 
upon the data generated experimentally and in clinical trials. Several factors such as the 
desired effect, the delivery vehicle, disease indication, and the route of administration, will 
affect the dosage. Dosages can be readily determined by one of ordinary skill in the art and 
5 formulated into the subject pharmaceutical compositions. Preferably, the duration of 
treatment will extend at least through the course of the disease symptoms. 

Dosage regima may be adjusted to provide the optimum therapeutic response. For 
example, the oligonucleotide may be repeatedly administered, e.g., several doses may be 
administered daily or the dose may be proportionally reduced as indicated by the exigencies 
10 of the therapeutic situation. One of ordinary skill in the art will readily be able to determine 
appropriate doses and schedules of administration of the subject oligonucleotides, whether the 
oligonucleotides are to be administered to cells or to subjects. 



Treatment of Diseases or Disorders 

15 By inhibiting the expression of a gene, the oligonucleotide compositions of the 

present invention can be used to treat any disease involving the expression of a protein. 
Examples of diseases that can be treated by oligonucleotide compositions include: cancer, 
retinopathies, autoimmune diseases, inflammatory diseases (e.g., ICAM-1 related disorders, 
Psoriasis, Ulcerative Colitus, Crohn's disease), viral diseases (e.g,, HIV, Hepatitis C), and 

20 cardiovascular diseases. 

In one embodiment, in vitro treatment of cells with oligonucleotides can be used for 
ex vivo therapy of cells removed from a subject (e.g., for treatment of leukemia or viral 
infection) or for treatment of cells which did not originate in the subject, but are to be 
administered to the subject (e.g., to eliminate transplantation antigen expression on cells to be 

25 transplanted into a subject). In addition, in vitro treatment of cells can be used in non- 
therapeutic settings, e.g., to evaluate gene function, to study gene regulation and protein 
synthesis or to evaluate improvements made to oligonucleotides designed to modulate gene 
expression or protein synthesis. In vivo treatment of cells can be useful in certain clinical 
settings where it is desirable to inhibit the expression of a protein. There are numerous 

30 medical conditions for which antisense therapy is reported to be suitable (see e.g., U.S. patent 
5,830,653) as well as respiratory syncytial virus infection (WO 95/22553) influenza virus 
(WO 94/23028), and mahgnancies (WO 94/08003). Other examples of clinical uses of 
antisense oUgonucleotides are reviewed, e.g., in Glaser. 1996. Genetic Engineering News 
16:1. Exemplary targets for cleavage by oligonucleotides include e.g., protein kinase Ca, 

35 ICAM-1, c-raf kinase, p53, c-myb, and the bcr/abl fusion gene found in chronic myelogenous 
leukemia. 
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The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of cell biology, cell culture, molecular biology, microbiology, 
recombinant DNA, and immunology, which are within the skill of the art. Such techniques 
5 are explained fully in the literature. See^ for example. Molecular Cloning A Laboratory 
Manual, 2nd Ed., ed. by Sambrook, J. et al (Cold Spring Harbor Laboratory Press (1989)); 
Short Protocols in Molecular Biology, 3rd Ed., ed. by Ausubel, F. et al (Wiley, NY (1995)); 
DNA Cloning, Volumes I and II (D. N. Glover ed., 1985); Oligonucleotide Synthesis (M, J. 
Gait ed. (1984)); MuUis et al U.S. Patent No: 4,683,195; Nucleic Acid Hybridization (B. D. 
10 Hames & S. J. Higgins eds. (1984)); the treatise. Methods In Enzymology (Academic Press, 
Inc., N. Y.); Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, 
eds.. Academic Press, London (1987)); Handbook Of Experimental Immunology, Volumes I- 
IV (D. M. Wen- and C. C. Blackwell, eds. (1 986)); and Miller, J. Experiments in Molecular 
Genetics (Cold Spring Harbor Press, Cold Spring Harbor, N.Y. (1972)). 

1 5 The invention is further illustrated by the following examples, which should not be 

construed as further Hmiting. 

EXAMPLES 

20 Example L Ability of Oligonucleotide Compositions to Inhibit CDK2 in A549 Cells. 

In this example, the ability of 5 different antisense oligonucleotides individually was 
compared with the ability of all 5 of the antisense oligonucleotides transfected at one time for 
their ability to inhibit the expression of CDK2 in A549 cells. The sequences of the 5 
antisense oligonucleotides used were: Oligonucleotide 1 GCAGUAUACCUCUCGCU- 

25 CUUGUCAA (SEQ ID NO: ##); oligonucleotide 2 UUUGG AAGUUCUCCAUGAA- 
GCGCCA (SEQ ID NO: ##); oligonucleotide 3 GUCCAAAGUCUGCUA- 
GCUUGAUGGC (SEQ ID NO: ##); oligonucleotide 4 CCCAGGAGGAUUU- 
CAGGAGCUCGGU (SEQ ID NO: ##); oligonucleotide 5 UAGAAGUAACUCCU- 
GGCCACACCAC (SEQ ID NO: ##); reverse control AACUGUUCUCGCUC- 

30 UCCAUAUGACG (SEQ ID NO: ##). 

For transfection with antisense oligonucleotides A549 cells were maintained in 
DMEM with high glucose (Gibco-BRL) supplemented with 10% Fetal Bovine Serum, 2mM 
L-Glutamine, and IX penicillin/streptomycin. 
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On the day before transfection 24-well plates were seeded with 30,000 A549 cells per 
well. The cells were approximately 60% confluent at the start of transfection, and were 
evenly distributed across the plate. On the day of transfection, a lOX stock of Lipofectamine 
2000 (Invitrogen) was prepared in Opti-MEM (serum free media, Gibco-BRL). The diluted 
5 lipid was allowed to stand at room temperature for 15 minutes. The optimal conditions for 
transfection of A549 cells were determined to be 25 nM oligonucleotide complexed with 1 
ug/mL Lipofectamine 2000. A lOX stock of each oligonucleotide to be used in the 
transfection was also prepared in Opti-MEM (lOX concentration of oligonucleotide is 0.25 
uM). Equal volumes of the lOX Lipofectamine 2000 stock and the lOX oligonucleotide 
10 solutions were mixed well and incubated for 1 5 minutes at room temperature to allow 

complexation of the oligonucleotide and lipid. The resulting mixture was 5X. After the 1 5 
minutes of complexation, four volumes of full growth media was added to the 
oligonucleotide/lipid complexes to make a IX solution. The media was aspirated from the 
cells, and 0.5 mL of the IX oligonucleotide/lipid complexes was added to each well. The 
15 cells were not permitted to dry out during the changing of media. The cells were incubated for 
16-24 hours at 3TC in a humidified CO2 incubator. Cell pellets were harvested for protein 
determination or RNA isolation. The Tables below show the results of the experiment. 



Oligonucleotide 


Ratio of CDK2 expression 
to GAPDH expression 


Standard Deviation 


No transfection 


1.481 


0.242 


FITC 


1.004 


0.203 


1 


0.233 


0.041 


2 


0.231 


0.058 


3 


0.198 


0.015 


4 


0.193 


0.065 


5 


0.673 


0.232 


Reverse Control 


0.749 


0.079 


Oligonucleotide 
Composition 


0.137 


0.012 



20 



Oligonucleotide 


Percent Inhibition Compared to Reverse 
Control 


No transfection 


0 (-98%) 


FITC 


0 (-34%) 


1 


69% 
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Oligonucleotide 


Percent Inhibition Compared to Reverse 
Control 


2 


69% 


3 


74% 


4 


74% 


5 


10% 


Reverse Control 


0% 


Oligonucleotide Composition 


82% 



The levels of expression of CDK2 were normalized to levels of GAPDH. No 
transfection or transfection with a fluorescent control oligonucleotide (which targets 
luciferase) showed levels of 1 or higher. A reverse sequence control oligonucleotide gave a 
5 level of about 0.8. Each of the individual oligonucleotides (1-5) showed inhibition in CDK2 
expression (with levels ranging from about 0.2 (about 70% inhibition compared to the reverse 
control) to 0.65 (10% inhibition compared to the reverse control) for oligonucleotide number 
5). All five of the oligonucleotides transfected at once gave a level of less than about 0.2, 
about 82% inhibition compared to the reverse control. Thus, using only one transfection, an 
10 oligonucleotide composition comprising five different antisense oligonucleotides can be used 
to efficiently inhibit protein synthesis. 
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Example 2. Summaiy of Results of Experiments in Which Oligonucleotide Compositions 
Were Tested on Thirty Different Genes. 

Figure 1 shows a smninary of the resuhs of about 30 antisense inhibition experiments 
against about thirty different genes in cell culture. Antisense was transfected as described in 
5 Example 1 and inhibition analyzed by Taqman real time PGR using standard methods. In 
each case the antisense inhibition was determined by comparison to a control oligonucleotide 
of the same chemistry that was not antisense to the target gene. Antisense compositions 
comprised 5-8 antisense oligonucleotides that had been designed against each gene, and 
individual oligonucleotides where compared to the mixtures of 5 or more antisense 

10 oligonucleotides. For three target genes the mixtures did not work well, and these data were 
eliminated from the analysis of the mixtures. Remarkably, the mixtures inhibited 
approximately as well (81-vs 84%) as the best individual oligonucleotide. The average 
inhibition of all individual oligonucleotides was much lower (56%), with a much higher 
variation. Thus, using the mixtures allows one to obtain high inhibition in the vast majority 

1 5 of cases (~90% of tlae target genes) without first screening through individual 

oligonucleotides to select those which work best. Also, as evidenced by the increased 
variation in the results obtained when individual oligonucleotides were used, in many cases 
the mixture was better than the best individual oligonucleotide. 

20 Example 3. Ultramer data for a mixture of siRNA complexes targeting p53, 

HeLa cells were transfected with 50 nM siRNA complexed with 1 ug/mL of 
Lipofectamine 2000 for 24 hours. After 24 hours, cells were lysed and RNA isolated for 
analysis by RT-PGR. Seven siRNA complexes were transfected that target a unique site of 
the p53 gene and a mixture of all seven siRNAs (equal concentrations of each) called the 
25 "siRNA ultramer." The best siRNA complex inhibited the target by 87% and the ultramer 
inhibited 69% compared to average of the controls. 



P53 sequences (Antisense, Sense): 



siRNAl: 



30 



AUGGAGGAGCCGCAGUCAGTT 



CUGACUGCGGCUCCUCCAUTT 



(SEQ ID NO: ##) 
(SEQ ID NO: ##) 



siRNA2: 



ACAUGACGGAGGUUGUGAGTT 



CUCACAACCUCCGUCAUGUTT 



(SEQ ID NO: ##) 
(SEQ ID NO: ##) 



siRNA3: 



35 



GACCAUCGCUAUCUGAGCATT 



(SEQ ID NO: ##) 
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UGCUCAGAUAGCGAUGGUCTT (SEQ ID NO: ##) 
siRNA4: 

GUACAGUCAGAGCCAACCUTT (SEQ ID NO: ##) 
AGGUUGGCUCUGACUGUACTT (SEQ ID NO: ##) 
5 siRNAS: 

ACCUCAAAGCUGUUCCGUCTT (SEQ ID NO: ##) 

GACGGAACAGCUUUGAGGUTT (SEQ ID NO: ##) 
siRNA6: 

CCUCAUUCAGCUCUCGGAATT (SEQ ID NO: ##) 

10 UUCCGAGAGCUGAAUGAGGTT (SEQ ID NO: ##) 
siRNA7: 

CCCUUCUGUCUUGAACAUGTT (SEQ ID NO: ##) 

CAUGUUCAAGACAGAAGGGTT (SEQ ID NO: ##) 



15 Example 4. Ultramer data for a mixture o fsiRNA complexes tar^etim GTP20 . 

Human Mesenchymal Stems cells (hMSC) were transfected with 2 ug/mL 
Lipofectamine 2000 complexed to 400 nM siRNA (total concentration, for clarity in the 
mixture each individual oHgomer was at 80nM). Five siRNA duplexes targeted to GTP20 
(TD), one composition matched control duplex (CD) and an equimolar mixture of each of the 
20 5 oligos ("Mixture") were transfected continuously for 24 hours and RNA was harvested 
using the RNA Catcher (Sequitur, Inc. Natick, MA). Expression of GTP20 mRNA was 
quantified by Taqman and normalized to GAPDH. Inhibition of 70% or greater relative to 
the control duplex was achieved using TD5 (70%) and the Ultramer (76%). 

Himian mesenchymal stem cells were plated at 15,000 per well in 48 well dishes and 
25 transfected 24 hours later. Lipofectamine 2000 was diluted in Opti-MEM to a 1 OX 

concentration of 20 ug/mL and incubated for 15 minutes. Following incubation, lipid was 
complexed to siRNA duplexes by addition of lOX lipid to an equal volume of lOX (4uM) 
siRNA, and incubated for 15 minutes. 5X lipid/siRNA complexes were diluted to IX by the 
addition of MSC Differentiation Media. 250 ul of each IX siRNA treatment was added per 
30 well of 48 well dish. Each treatment was applied to triplicate wells. Osteoblastic 

differentiation of MSC was induced approximately 4 hours after transfection. Cells were 
differentiated for 4 days prior to RNA isolation. 
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Example 5. Ultramer data for a mixture of siRNA complexes targeting Cbfa-L 



Hxrnian Mesenchymal Stems cells (hMSC) were transfected with 2 ug/mL 
Lipofectamine 2000 complexed to 400 nM siRNA (total concentration, in mixture each 
individual duplex was at 80nM). Five targeted duplexes (TD), five control duplexes (CD), 
5 one equimolar mixture of all 5 duplexes ("Mixture") and one control Mixture(UC) were 
transfected continuously for 72 hours. RNA was harvested 96 hours after transfection using 
the RNA Catcher. Expression of Cbfa-1 niRNA was quantified by Taqman and normalized 
to GAPDH. Inhibition of 70% or greater relative to the average of the control duplexes was 
achieved usmg TD4 (74%). The Mixture inhibited 70% relative to the Mixture Control. 

1 0 Human mesenchymal stem cells were plated at 1 5,000 per well in 48 well dishes and 

transfected 24 hours later. Lipofectamine 2000 was diluted in Opti-MEM to a 1 OX 
concentration of 20 ug/mL and incubated for 15 minutes. Following incubation, lipid was 
complexed to siRNA duplexes by addition of lOX lipid to an equal volume of lOX (4uM) 
siRNA, and incubated for 15 minutes. 5X lipid/siRNA complexes were diluted to IX by the 

1 5 addition of MSC Differentiation Media. 250 ul of each IX siRNA treatment was added per 
well of 48 well dish. Each treatment was applied to triplicate wells. Osteoblastic 
differentiation of MSC was induced approximately 4 hours after transfection. Cells were 
differentiated for 4 days prior to RNA isolation. The following antisense sequences of Cbfa- 
1 siRNA duplexes were used (corresponding sense sequences where the complementary. 

20 sequence with a 2nt TT 3 ' overhang, T's are DNA, all other nucleotides are RNA): 



TDl 


(818883) 


: AUUUAAUAGCGUGCUGCCATT 


(SEQ 


ID 


NO: 


##) 


TD2 


(S18885) 


: CUGUAAUCUGACUCUGUCCTT 


(SEQ 


ID 


NO: 


##) 


TD3 


(S18887) 


: AAUAUGGUCGCCAAACAGATT 


(SEQ 


ID 


NO: 


##) 


TD4 


(S18889) 


: GUCAACACCAUCAUUCUGGTT 


(SEQ 


ID 


NO: 


##) 


TD5 


(S18891) 


: AGGUUUAGAGUCAUCAAGCTT 


(SEQ 


ID 


NO: 


##) 


CDl 


(S18884) 


: ACCGUCGUGCGAUAAUUUATT 


(SEQ 


ID 


NO: 


##) 


CD2 


(S18886) 


: CCUGUCUCAGUCUAAUGUCTT 


(SEQ 


ID 


NO: 


##) 


CD3 


(S18888) 


: AGACAAACCGCUGGUAUAATT 


(SEQ 


ID 


NO: 


##) 


CD4 


(S18890) 


: GGUCUUACUACCACAACUGTT 


(SEQ 


ID 


NO: 


##) 


CDS 


{S18892) 


: CGAACUACUGAGAUUUGGATT 


(SEQ 


ID 


NO: 


##) 
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Example 6. Ultramer data for a mixture of siRNA complexes tar^etin^ PTP mu. 



Efficacy of all phosphorothioate DNA 25nt antisense oligonucleotides targeted 
against PTP mu mRNA in human lung carcinoma (A549) cells. Potent inhibition of mRNA 
was obtained following a 16 hour transfection of A549 cells with 25 nM oligo. AS: antisense 
5 oligonucleotide; RC: reverse control; MIX: mixture of individual AS oligomers (total 
oligomer concentration of 25 nM). Target mRNA quantity was normalized to GAPDH. 

A549 cells at passage 3 were plated at 25,000 cells/well in 48 well plates and 
incubated overnight in a humidified 5% COj chamber (37 °C). A 250 nM solution of AS 
oligomer in Optimem-I (Gibco BRL) was mixed with an equal volume of 10 ug/mL 

10 lipofectamine 2000 (InVitrogen) in Optimem-I (lipid solution was pre-incubated at 25C for 
15 minutes). Oligomer-lipid complexes were formed by incubation at room temperature for 
15 minutes. 4 volumes of DMEM plus 10% fetal serum medium was added to the complexes 
and 250 ul of the diluted suspension was added to cells. The final concentration of oligomer 
was 25 nM. Following a 16h transfection, cells ware washed with PBS and poly A+ mRNA 

1 5 was isolated using Sequitur's mRNA Catcher. mRNA was quantified by real time RT-PCR 
(Taqman); automated data collection was with an ABI prism® sequence detection system. 
Data are normalized to GAPDH mRNA. Oligonucleotide sequences: ASl, CAUUCA- 
CCAGCAUGAGAGAACCUGA (SEQ ID NO: ; AS2, TCCCAGAGGCAT- 
TCACCAGCATGAG (SEQ ID NO: ##) ; AS3, UCCAGAUAGGAUUCCC- 

20 CAGUGGCCC (SEQ ID NO: ##) ; AS4, CUGGUCAGGAGCACACUAAUCUCAU 
(SEQ ID NO: ## ) ; AS5, AGUCAAGGUGUUCACUUGCUCCCAA (SEQ ID NO: 
##); AS6, AAGUACUAAUGGCCAGUUCUGCCC (SEQ ID NO: ##);AS7, 
CCCUGUAACCAGAGCCUGUCUCCUG (SEQ ID NO: ##) ; AS8, GAGCUGG- 
UCACCUUGAUUUCCUUCA (SEQ ID NO: ##) ; AS9, CCAGGCAAGUCCCAAGU- 

25 GUCCUCAU (SEQ ID NO: ##) ; ASIO, GAUGUCCUAACACCUUCACCUCAUC 
(SEQ ID NO: ##); MIX, equimolar solution of AS 1 through ASIO. 

Example 7. Ultramer data for a mixture of siRNA complexes targeting PTP-PEST. 

Efficacy of 25nt phosphorothioate DNA antisense oligonucleotides targeted against 
30 PTP-PEST mRNA in Human Umbilical Vein Endothelial Cells (HuVEC). Inhibition of 

mRNA was obtained following a 4 hour serum- free transfection of cells with 200 nM oligo 
followed by a 14 h incubation in serum-containig medium, AS: antisense oligonucleotide; 
RC: reverse control; Mixture: mixture of individual AS oligomers (total oligo concentration 
of 200 nM). Target mRNA quantity is normalized to GAPDH. 
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HuVEC cells at passage 3 were plated at 25,000 cells/well in 48 well plates and 
incubated overnight in a humidified 5% CO2 chamber (37''C). A 2000 nM solution of AS 
oligomer in Optimem-I (Gibco BRL) was mixed with an equal volume of 100 ug/mL 
Lipofectin (Gibco BRL) in Optimem-I (lipid solution was pre-incubated at 25°C for 30 
5 minutes). Oligomer-lipid complexes were formed by incubation at room temperature for 30 
minutes. 4 volumes of Optimem-I (serum-free) was added to the complexes and 250 ul of the 
diluted suspension was added to cells. Four hours later, the transfection complexes were 
aspirated and replaced with 250 ul of EGM-2 complete serum medium 
(Clonetics/Biowhittaker). Following a 16h transfedtion, cells ware washed with PBS and 
10 poly A+ mRNA was isolated using an mRNA Catcher (Sequitur, Inc.). mRNA was 
quantified by real time RT-PCR (Taqman); automated data collection was with an ABI 
prism® sequence detection system. 

Data are normalized to GAPDH mRNA. ASl, CCCAUUGUGGUCAGGAC- 
UCUUCAUGU (SEQ ID NO: ##) ; AS2, UUCCCAUCUCAAAUUCU-CGGCAGGCU 
15 (SEQ ID NO: ##) ; AS3, UGGCACAAAUGGCACCUGUUCUUCCU (SEQ ID NO: 
##);RC,GACUCCUUUAAGUAGGUCUCCCAGGU (SEQ ID NO: ##).M1X, 
equimolar solution of AS 1 , AS2, and AS3. 



Example 8. Ultramer data for a mixture of siRNA complexes targeting PTP-eta 

20 Efficacy of all phosphorothioate DNA 25nt antisense oligonucleotides targeted 

against PTP-eta mRNA in Normal Rat Kidney (NRK) cells. Inhibition of mRNA was 
obtained following an overnight transfection of cells with 25 nM oligo. AS: antisense 
oligonucleotide; RC: reverse control; Mix: mixture of individual AS oligomers (total 
oligomer concentration of 25 nM). Target mRNA quantity is normalized to GAPDH. 

25 NRK cells at passage 5 were plated at 25,000 cells/well in 48 well plates and 

incubated overnight in a humidified 5% C02 chamber (37°C). A 250 nM solution of AS 
oligomer in Optimem-I (Gibco BRL) was mixed with an equal volume of 10 ug/mL 
Lipofectamine 2000 (InVitrogen) in Optimem-I (lipid solution was pre-incubated at 25C for 
30 minutes). Oligomer-lipid complexes were formed by incubation at room temperature for 

30 1 5 minutes. 4 volumes of complete DMEM plus 5% bovine calf serum were added to the 
complexes and 250 ul of the diluted suspension was layered onto cells. The final oligomer 
concentration was 25 nM. Following a 16h incubation, cells ware washed with PBS and poly 
A+ mRNA was isolated using Sequitur's mRNA Catcher*. mRNA was quantified by real 
time RT-PCR (Taqman*); automated data collection was with an ABI prism® sequence 

35 detection system. 
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Data are normalized to GAPDH mKNA. ASl, ACCUGUGCACACAACCUGGC- 
CCUGGU (SEQ ID NO: ##) ; AS2, ACAGUAUACCGCAGCGUGUUUCCCUU 
(SEQ ID NO: ##) ; ASS, GUCUCAUUGACUGUUCCCAAGGUGAU (SEQ ID NO: 
##);AS4,GCUCUACAAUCUGCAUCCGGUAAGAU (SEQ ID NO: ##);AS5, 
5 UCUGUGCCAUCUGCUGCUUGAGAAUU (SEQ ID NO: ##) ; AS6,UGUUCAC- 
AGCUCGGAUGUCAGAAACU (SEQ ID NO: ##) ; RC, UAAGAGUUCGUCGU- 
CUACCGUGUCUU (SEQ ID NO: ##);MIX,equiniolarsolutionof ASl through AS6 

Equivalents 

1 0 Those skilled in the art will recognize, or be able to ascertain using no more than 

routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following claims. 
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CLAIMS 

1. An oligonucleotide composition comprising at least 3 different oligonucleotides 

5 targeted to at least three different nucleotide sequences within a target gene, wherein 

(i) the oligonucleotides bind to their target nucleotide sequence with high affinity and 

(ii) the oligonucleotides are GC enriched. 

2. The oligonucleotide composition of claim 1, wherein the oligonucleotides are 
1 0 antisense oligonucleotides. 

3. The oligonucleotide composition of claim 1, wherein the oligonucleotides are double- 
stranded KNA oligonucleotides. 

1 5 4. The oligonucleotide composition of claim 1 , wherein the oligonucleotide 

compositions bind to their target nucleotide sequence with a Tm of at least about 
60°C. 

5. The oligonucleotide composition of claim 1, wherein the oligonucleotides have a GC 
20 content of at least about 20%. 

6. The oligonucleotide composition of clakn 1 , wherein the composition comprises at 
least about 4 antisense oligonucleotides targeting at least four different nucleic acid 
sequences. 

25 

7. The oligonucleotide composition of clahn 1 , wherein the composition comprises at 
least about 5 oligonucleotides targeting at least five different nucleic acid sequences. 

8. The oligonucleotide composition of claim 1, wherein the composition comprises at 
30 least about 6 oligonucleotides targeting at least six different nucleic acid sequences. 
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9. The oligonucleotide composition of claim 1, wherein the oligonucleotides are at least 
about 25 nucleomonomers in length. 

1 0. The oligonucleotide composition of claim 1 , wherein the oligonucleotides are greater 
5 than 25 nucleomonomers in length. 

1 1 . The oligonucleotide composition of claim 2, wherein at least one of the antisense 
ohgonucleotides is complementary in sequence to its target nucleotide sequence. 

10 12. The oligonucleotide composition of claim 2, wherein the antisense oligonucleotides 
activate RNase H. 

13. The oligonucleotide composition of claim 1, wherein at least one of the 
oligonucleotides comprise at least one modified intemucleoside linkage. 

15 

14. The oligonucleotide composition of claim 1, wherein at least one of the 
oligonucleotides comprise at least one modified sugar moiety. 

15. The oligonucleotide composition of claim 1 , further comprising a pharmaceutically 
20 acceptable carrier. 

16. The oligonucleotide composition of claim 1, wherein the oligonucleotide composition 
achieves a level of inhibition of protein synthesis the same as or higher than the level 
of inhibition achieved by the most effective individual oligonucleotide of the 

25 composition. 

1 7. The oligonucleotide composition of claim 1 , wherein the individual oligonucleotides 
are not separately tested for their ability to inhibit protein synthesis prior to their 
incorporation into the composition. 

30 

18. The oligonucleotide composition of claim 1, wherein the oligonucleotide composition 
results in greater than about 80% inhibition of protein synthesis. 
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1 9. A method of inhibiting protein synthesis in a cell comprising contacting the cell with 
at least 3 different oligonucleotides targeted to at least three different nucleotide 
sequences within a target gene, wherein (i) the oHgonucleotides bind to their target 

5 nucleotide sequence with high aflRnity and (ii) the oligonucleotides are GC enriched, 

to thereby inhibit protein synthesis. 

20, The method of claim 1 9, wherein the oligonucleotides are antisense oHgonucleotides. 

10 21 . The method of claim 1 9, wherein the oligonucleotides are double-stranded RN A 
oligonucleotides. 

22. The method of claim 1 9, wherein the method is performed in a high-throughput 
format. 

15 

23. A method ofidentifying function ofa gene encoding a protein comprising: contacting ^ 
the cell with at least 3 different oligonucleotides targeted to at least three different 
nucleotide sequences within a target gene, wherein (i) the oligonucleotides bind to 
theix target nucleotide sequence with high affinity and (ii) the oligonucleotides are GC 

20 enriched, and assaying for a change in a detectable phenotype in the cell resultmg 

from the inhibition of protein expression, to thereby determine the fiinction of a gene. 

24. The method of claim 23, wherein the oligonucleotides are antisense oligonucleotides. 

25 25. The method of claim 23, wherein the oligonucleotides are double-stranded RNA 
oligonucleotides. 

26. The method of claim 23, wherein the method is perfonned in a high-throughput 
format. 

30 
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27. A method of making the oligonucleotide composition of claim 1, comprising: 
combining at least 3 different oligonucleotides targeted to at least three different 
nucleotide sequences within a target gene, wherein (i) the oligonucleotides bind to 
their target nucleotide sequence with high aflTinity and (ii) the oligonucleotides are GC 

5 enriched, and wherein, the individual oligonucleotides are not separately tested for 

their ability to inhibit protein synthesis prior to their incorporation into the 
composition. 

28. The method of claim 27, wherein the oligonucleotides are antisense oligonucleotides. 

10 

29. The method of claim 27, wherein the oUgonucleotides are double-stranded RNA 
oligonucleotides. 



30. An oligonucleotide composition comprising at least 3 different double-stranded RNA 
1 5 oligonucleotides targeted to at least three dijQferent nucleotide sequences within a 

target gene. 



31 . A method of inhibiting protein synthesis in a cell comprising contacting the cell with 
at least 3 different double-stranded RNA oligonucleotides targeted to at least three 
20 different nucleotide sequences within a target gene. 



32. A method of identifying function of a gene encoding a protein comprising: contacting 
the cell with at least 3 different double-stranded RNA ohgonucleotides targeted to at 
least three different nucleotide sequences within a target gene and assaying for a 
25 change in a detectable phenotype in the cell resulting from the inhibition of protein 

expression, to thereby determine the function of a gene. 



33. A method of making an oligonucleotide composition comprising: combining at least 3 
different double-stranded RNA oligonucleotides targeted to at least three different 
30 nucleotide sequences within a target gene wherein, the individual oligonucleotides are 

not separately tested for their ability to inhibit protein synthesis prior to their 
incorporation into the composition. 
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